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In memory of my father 


Preface 


This book is intended for students encountering the beautiful subject of abstract 
algebra for the first time. My goal here is to provide a text that is suitable for you, 
whether you plan to take only a single course in abstract algebra, or to carry on to 
more advanced courses at the senior undergraduate and graduate levels. Naturally, I 
wish to encourage you to study the subject further and to ensure that you are 
prepared if you do so. 

At many universities, including my own, abstract algebra is the first serious 
proof-based course taken by mathematics majors. While it is quite possible to get 
through, let us say, a course in calculus simply by memorizing a list of rules and 
applying them correctly, without really understanding why anything works, such an 
approach would be disastrous here. To be sure, you must carefully learn the defi- 
nitions and the statements of theorems, but that is nowhere near sufficient. In order 
to master the material, you need to understand the proofs and then be able to prove 
things yourself. This book contains hundreds of problems, and I cannot stress 
strongly enough the need to solve as many of them as you can. Do not be dis- 
couraged if you cannot get all of them! Some are very difficult. But try to figure out 
as many as you can. You will only learn by getting your hands dirty. 

As different universities have different sequences of courses, I am not assuming 
any prerequisites beyond the high school level. Most of the material in Part I would 
be covered in a typical course on discrete mathematics. Even if you have had such a 
course, I urge you to read through it. In particular, you absolutely must understand 
equivalence relations and equivalence classes thoroughly. (In my experience, many 
students have trouble with these concepts.) From time to time, throughout Parts II 
and III, some examples involving matrices or complex numbers appear. These can 
be bypassed if you have not studied linear algebra or complex numbers, but in any 
case, the material you need to know is not difficult and is discussed in the 
appendices. In Part IV, it is necessary to know some linear algebra, but all of the 
theorems used are proved in the text. 


Vii 


Vili Preface 


The fundamental results about groups are covered in Chaps. 3 and 4, those about 
rings are in Chaps. 8 and 9, and the introductory theorems concerning fields and 
polynomials are found in Chap. 11. I think that these chapters are essential in any 
course. Beyond that, there is a fair amount of flexibility in the choice of topics. 

I confess my first encounter with abstract algebra was a joyous experience. 
I found (and still find!) the subject fascinating, and I will consider the time I put into 
this book well spent if you emerge with an appreciation for the field. 

I would like to thank Lynn Brandon and Anne-Kathrin Birchley-Brun at 
Springer for their help in making this book a reality. Also, thanks to the reviewers 
for their many useful suggestions. I thank my wife and family for their ongoing 
support. Finally, thanks to my teacher, Prof. Sudarshan Sehgal, both for his advice 
concerning this book and for all of his help over the years. 


Thunder Bay, ON, Canada Gregory T. Lee 
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Part I 
Preliminaries 


Chapter 1 @) 
Relations and Functions eects 


We begin by introducing some basic notation and terminology. Then we discuss 
relations and, in particular, equivalence relations, which we shall see several times 
throughout the book. In the final section, we talk about various sorts of functions. 


1.1 Sets and Set Operations 


A set is a collection of objects. We will see many sorts of sets throughout this course. 
Perhaps the most common will be sets of numbers. For instance, we have the set of 
natural numbers, 

N= {152,350.45 


the set of integers, 
Z={...,—2,—-1,0,1,2,...} 


and the set of rational numbers 
a 
Q= [Fiabe Z,b 40}. 


We also write IR for the set of real numbers and C for the set of complex numbers. 
But sets do not necessarily consist of numbers. Indeed, we can consider the set of 
all letters of the alphabet, the set of all polynomials with even integers as coefficients 
or the set of all lines in the plane with positive slope. 
The objects in a set are called its elements. We write a € S if ais an element of a 
set S. Thus, —3 € Z but —3 ¢ N. The set with no elements is called the empty set, 
and denoted @. Any other set is said to be nonempty. 
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If S and T are sets, then we say that S is a subset of 7, and write S C T, if every 
element of S is also an element of 7. Of course, S C S. We say that S is a proper 
subset of T, and write SC T, if SC T but S AT. Thus, it is certainly true that 
N C Z, but we can be more precise and write N ¢ Z. 

For any two sets S and T, their intersection, SM T, is the set of all elements that 
lie in S and T simultaneously. 


Example 1.1. Let S = {1,2,3,4,5} and T = {2, 4,6, 8, 10}. Then SM T = {2, 4}. 


We can extend this notion to the intersection of an arbitrary collection of sets. If 
T is anonempty set and, for each i € J, we have a set 7;, then we write (),., T; for 
the set of elements that lie in all of the 7; simultaneously. 


iel 
Example 1.2. Foreachg € Q,letT, = {re R:r < 27}. Then (),<9 T, ={reR: 
r < O}. 


Also, for any sets S and T, their union, S U 7, is the set of all elements that lie 
in S or T (or both). 


Example 1.3. Using the same S and T as in Example 1.1, we have 
SUT = {1, 2, 3,4, 5,6, 8, 10}. 


Furthermore, if J is a nonempty set and we have a set 7; for eachi € J, then we 
write (J ;<, Zi for the union of all of the 7;; that is, the set of all elements that lie in 
at least one of the 7;. 


Example 1.4. If we use the same sets T, as in Example 1.2, we have Uso T, = R. 


In addition, for any two sets S and T, the set difference (or relative complement) 
is the set S\T = {a € S:a ¢ T}. 


Example 1.5. Once again using S and T as in Example 1.1, we have S\T = {1, 3, 5}. 


We will need one more definition. The following construction is named after René 
Descartes. 


Definition 1.1. Let S and T be any sets. Then the Cartesian product S x T is the 
set of all ordered pairs (s, t), withs € Sandt € T. 


Example 1.6. Let S = {1,2, 3} and T = {2, 3}. Then 
Sx T = {(1, 2), (1, 3), (2, 2), (2, 3), G, 2), GB, 3)}. 
There is also a Cartesian product of finitely many sets. For any sets T,, 7, ..., Th, 


welet 7, x T, x --- x T,, be the set of all ordered n-tuples (t), 4, ..., ¢,), witht; € T; 
for all 7. 
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Example 1.7. Let T; = {1, 2}, Ts = {a, b} and T3 = {2,3}. Then 7, x Ty x T3 is 
the set 


{(, a, 2), , a, 3), C1, b, 2), C1, b, 3), (2, a, 2), (2, a, 3), (2, b, 2), (2, b, 3)}. 
Exercises 
1.1. Let S = {1,2,3} and T = {3,4}. Find SAT, SUT, S\T,T\S and S x T. 


1.2. Let R = {a,b,c}, S = {a,c,d} and T = {c,e, f}. Find RNS, RN(S\T), 
SUT,SO(RUT) and R x S. 


1.3. Let R, S and T be sets with R C S. Show that RUT C SUT. 
1.4. Let S = {1,2,...,n}, for some positive integer n. Show that S has 2” subsets. 
1.5. Let R, S and T be any sets. Show that RU(SOT) = (RUS)N(RUT). 


1.6. For each positive integer n, let T, = {4 : a € Z}. 


1. What is |)”. 7,2 
2. What is (1), T,? 


1.2 Relations 


We are going to use relations (in particular, the equivalence relations and functions 
that we will see in the next two sections) quite a few times in this course. 


Definition 1.2. Let S and T be sets. Then a relation from S to T is a subset p of 
Sx T.Ifs € Sandt € T, then we write spt if (s, t) € p; otherwise, we write s f f. 
In particular, a relation on S is a relation from S to S. 


Example 1.8. Let S = {1, 2,3} and T = {1, 2, 3, 4}. Define a relation p from S to T 
via spt if and only if st? <4. Then p = {(1, 1), (1, 2), 2, D, GB, D}. In particular, 
3e1 but | A 3. 


We will focus on relations on a set. Let us discuss a few properties enjoyed by 
some relations. 


Definition 1.3. Let p be a relation on S. We say that p is reflexive if apa for all 
aeS. 


Example 1.9. On Z, the relation < is reflexive, but < is not. Indeed, a < a for all 
integers a, but | is not less than 1. 


Definition 1.4. A relation p on a set S is symmetric if apb implies boa. 
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Example 1.10. On Z, neither < nor < is symmetric, as | < 2 but 2 is not less than 
1 (and similarly for <). Define p via apb if and only if |a — b| < 10. Then p is 
symmetric. Indeed, if apb, then |a — b| < 10, and so |b — a| = |a — b| < 10; thus, 
boa. 


Definition 1.5. Let o be arelation ona set S. We say that p is transitive if, whenever 
apb and boc, we also have apc. 


Example 1.11. On Z, the relations < and < are both transitive. (Ifa < bandb < c, 
then a < c.) However, the relation p from Example 1.10 is not, since 18 and 813, 
but 1 A 13. 


These three properties lead us directly to the next section. 
Exercises 


1.7. Let S = {1, 2,3} and 7 = {3, 4,5, 6, 7, 8}. Define a relation p from S to T via 
apb if and only if |a* — b| < 1. Find all pairs (a, b) € S x T such that apb. 


1.8. Define a relation p on Z via apb if and only if ab is even. Is ¢ reflexive? 
Symmetric? Transitive? 


1.9. Define a relation p on R via apb if and only if a—beQ. Is ¢ reflexive? 
Symmetric? Transitive? 


1.10. Define a relation on R via apb if and only if a—b EN. Is ¢ reflexive? 
Symmetric? Transitive? 


1.11. 1. How many relations are there on {1, 2, 3}? 
2. How many of these relations are symmetric? 


1.12. For each of the eight subsets of {reflexive, symmetric, transitive}, find a rela- 
tion on {1, 2, 3} that has the properties in that subset, but not the properties that are 
not in the subset. 


1.3. Equivalence Relations 


Definition 1.6. An equivalence relation on a set S is a relation that is reflexive, 
symmetric and transitive. 


We will use the symbol ~ to denote an equivalence relation. 


Example 1.12. On Z, let us say thata ~ b if and only if a + bis even. We claim that 
~ is an equivalence relation. If a € Z, then a + a is certainly even, soa ~ a, and ~ 
is reflexive. If a ~ b, then a + b is even. But this also means that b + a is even, and 
hence b ~ a. Thus, ~ is symmetric. Finally, suppose that a ~ b and b ~ c. Then 
a+b and b-+c are both even. This means that their sum, a + 2b +c, is even. As 
2b is even, we see that a + c is even, and hence a ~ c. That is, ~ is transitive. 
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Example 1.13. On the set S = {a € Z: 1 <a < 20}, let a ~ b if and only if a = 
2”b for some m € Z. Let us verify that this is an equivalence relation. Reflexivity: 
Note that a = 2°a, and hence a ~ a. Symmetry: If a ~ b, say a = 2'""b, then b = 
2~"a, and hence b ~ a. Transitivity: Ifa ~ band b ~ c, saya = 2”b and b = 2"c, 
then a = 2”*"c, and therefore a ~ c. 


Example 1.14. OnR, let us say thata ~ bif and only if a — b € Z. Let us check that 
it is an equivalence relation. Reflexivity: If a € R, then a — a = 0 € Z, and hence 
a ~ a. Symmetry: Let a ~ b. Then a — b € Z, and hence b—a = —(a—b) € Z. 
Thus, b ~ a. Transitivity: Suppose that a ~ b and b~ cc. Thena—b,b—ceZ, 
and hence a — c = (a — b) + (b —c) € Z. That is,a ~ c. 


Let us try something slightly more complicated. 


Example 1.15. Let S = Z x (Z\{0}). Define ~ on S via (a, b) ~ (c, d) if and only 
if ad = bc. We must verify that ~ is an equivalence relation. Reflexivity: As ab = 
ba, we have (a, b) ~ (a, b) for all integers a and nonzero integers b. Symmetry: 
Suppose that (a, b) ~ (c,d). Thenad = bc, and this also tells us that (c, d) ~ (a, b). 
Transitivity: Let (a, b) ~ (c,d) and (c,d) ~ (e, f). Then ad = bc and cf = de. 
Thus, adf = bcf = bde. Since we are assuming that d # 0, this means that af = 
be. Therefore, (a, b) ~ (e, f). 


Equivalence relations are very special. 


Definition 1.7. Let ~ be an equivalence relation on a set S. Ifa € S, then the equiv- 
alence class of a, denoted [a], is the set {b Ee S: a ~ b}. 


Why are equivalence classes so interesting? We need another definition. 


Definition 1.8. Let S be a set, and let T be a set of nonempty subsets of S. We say 
that T is a partition of S if every a € S lies in exactly one set in T. 


Example 1.16. Let S = {1,2,3,4,5,6, 7} and T = {{1, 3, 4, 6}, {2, 7}, {5}}. Then 
T is a partition of S. 


What is the connection between these concepts? 


Theorem 1.1. Let S be a set, and ~ an equivalence relation on S. Then the equiv- 
alence classes with respect to ~ form a partition of S. In particular, if a € S, then 
a € [a] and, furthermore, a € [b] if and only if [a] = [b]. 


Proof. As ~ is reflexive, a ~ a, and hence a € [a] for every a € S. In particular, 
the equivalence classes are not empty, and every element of S is in at least one 
of them. Suppose that d € [a] M [c]. We must show that [a] = [c]. If e € [a], then 
a ~ e. Also, d € [a] means that a ~ d, and hence d ~ a by symmetry. Also, c ~ d. 
By transitivity, c ~~ a, and then c ~ e. Thus, e € [c], and therefore [a] C [c]. By 
the same argument, [c] C [a], and hence [a] = [c]. Thus, the equivalence classes 
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do indeed form a partition. To prove the final statement of the theorem, note that if 
a € [a] M [b], then [a] = [b] and, conversely, if [a] = [b], thena € [a]=[b]. O 


So, the equivalence classes break the set down into subsets having no elements 
in common. It is important to note that, unless there is only one element in an 
equivalence class, the representative chosen for that class is not unique. That is, if 
b € [a], then we could just as easily write [b] instead of [a]. They are the same class. 
This complicates matters a bit when we define operations on equivalence classes, 
as we will find ourselves doing throughout the course. We must make sure that 
our operations are well-defined; that is, that they do not depend upon the particular 
representative of the class that we use. 

Let us discuss the equivalence classes determined by the relations in our earlier 
examples. The plan is always the same. We know that each element of the set is in 
exactly one class. Thus, we will keep looking for elements of the set that are not in 
any classes we have constructed, and obtain new classes in this way. 


Example 1.17. In Example 1.12, let us start with 0. We know that a ~ 0 if and only 
if a is even. Thus, 


[O] = {..., —6, —4, —2,0, 2,4, 6,...}. 
(Note that we would have obtained the same class had we started, for instance, with 
14. Since 14 € [0], we have [0] = [14].) We have not yet found 1, so we note that 
a ~ 1 if and only if a + 1 is even; that is, if and only if a is odd. Therefore, 


as ee eee ee eee 


(Again, we could just as easily have used [—3].) We have now found all elements of 
Z. Thus, there are only two equivalence classes, [0] and [1]. 


Example 1.18. In Example 1.13, we may as well start with 1. We have 
[1] = {1, 2, 4, 8, 16}. 


As we have not yet found 3, 
[3] = {3, 6, 12}. 


We still do not have 5, and thus we take 
[5] = {5, 10, 20}. 
Similarly, we obtain 


[7] = {7, 14}, [9] = (9, 18}, (11) = {11}, 
[13] = {13}, [15] = {15}, [17] = {17}, and [19] = {19}. 


Once again, we could have used [8] in place of [1], for instance. 
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The other two examples are a bit trickier, since there are infinitely many equiva- 
lence classes. But we can attempt to describe them. 


Example 1.19. In Example 1.14, we see that b € [a] if and only if the difference 
between a and b is an integer. Thus, for instance, 


[23.86] = {..., —2.14, —1.14, —0.14, 0.86, 1.86, 2.86, ...}. 


Listing the classes is an impossible task. How, then, to describe them? We note that 
for any real number a, there is certainly an integer k such that 0 < a — k < 1. Now, 
a ~ (a —k), and hence every element of R is in a class [b], for some 0 < b < 1. 
Furthermore, if 0 < b,c < 1, then 0 < |b —c| < 1 and therefore b — c can only be 
an integer if b = c. That is, if 0 < b,c < 1 and b €c, then [b] ¥ [c]. Thus, the 
equivalence classes are precisely 


{[b]: be R, O<b <I}. 


Example 1.20. What about Example 1.15? We note that (c, d) € [(a, b)] if and only 
ifad = bc. Another way to say this is that ; = ee Thus, [(a, b)] consists of all ordered 
pairs (c,d), with c,d € Zandd ¥ 0, such that | = 4. This is, in fact, exactly how 
the rational numbers are constructed! We need to ensure that $ and : are treated as 
the same fraction, and these equivalence classes make that happen. We obtain one 
equivalence class for each fraction For instance, 


[(2, 3)] = {..., (—6, —9), (—4, —6), (—2, —3), (2, 3), (4, 6), (6, 9), ...}. 


Exercises 


1.13. Define a relation ~ on N viaa ~ bif and only if a — b = 3k, forsomek € Z. 
Is ~ an equivalence relation? If so, what are the equivalence classes? 


1.14. Define a relation ~ on {1, 2, 3, 4,5, 6, 7} viaa ~ b if and only if a and b are 
both even or both odd. Is ~ an equivalence relation? If so, what are the equivalence 
classes? 


1.15. Define a relation ~ on Z viaa ~ bif and only if |a| = |b|. Is ~ an equivalence 
relation? If so, what are the equivalence classes? 


1.16. Define a relation ~ on Z viaa ~ bif and only if ab > 0. Is ~ an equivalence 
relation? If so, what are the equivalence classes? 


1.17. Let S be the set of all subsets of Z. Define a relation ~ on S via T ~ U if and 
only if T C U. Is ~ an equivalence relation? If so, what are the equivalence classes? 


1.18. Let S be the set of all subsets of Z. Define a relation ~ on S via T ~ U if and 
only if T\U and U\T are both finite. Show that ~ is an equivalence relation and 
describe [{1, 2, 3}] and [{..., —4, —2, 0, 2,4, ...}]. 
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1.19. On the plane R’, define a relation ~ via (a, b) ~ (c, d) if and only if 3a — b = 
3c — d. Show that ~ is an equivalence relation, and describe [(4, 2)]. 


1.20. Let S be a nonempty set. Show that for any partition of S, there is an equiva- 
lence relation on S having the sets in the partition as its equivalence classes. 


1.21. Find an equivalence relation on N having exactly two equivalence classes, one 
of which contains exactly three elements. 


1.22. Suppose there is a relation p on a set S, such that p is both reflexive and 
transitive. Define ~ on S via a ~ b if and only if apb and boa. Show that ~ is an 
equivalence relation. 


1.4 Functions 


Let us give two equivalent definitions of a function. Formally, if S and T are sets, 
then a function from S to T is arelation p from S to T such that, foreach s € S, there 
is exactly one ¢ € T such that spt. In practice, nobody really thinks of functions in 
this way. The working definition follows. 


Definition 1.9. Let S and T be any sets. Then a function a : S > T is arule assign- 
ing, to each s € S, an element a(s) of T. 


Readers who have studied calculus will no doubt be familiar with functions from 
RtoR. 


Example 1.21. We can define a function a : R > R viaa(a) = 5a? — 4a* + Ja + 
3 for alla ER. 


But we do not need to go from R to R. 
Example 1.22. We can define afunctiona : Z > Qviaa(a) = (—2)* foralla € Z. 
In fact, the sets involved do not have to be sets of numbers. 


Example 1.23. Let S be the set of all English words and T the set of letters in the 
alphabet. We can define a : S — T by letting a(w) be the first letter of the word w, 
for every w € S. 


A few properties enjoyed by certain functions are important. 


Definition 1.10. A function a: § > T is one-to-one (or injective) if a(s;) = 
a(s2) implies s; = 52, for all 51, 52 € S. 


Putting this another way, a one-to-one function sends different elements to differ- 
ent places. 
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Example 1.24. Define functions and f from R to R viaa(a) = a” and B(a) = a’, 
for all a € R. Then @ is not one-to-one, since a(1) = a(—1), but 8 is one-to-one, 
since if a? = b°, then taking the cube root of both sides, we have a = b, for any 
a,beR. 


Definition 1.11. A function a : § — T is onto (or surjective) if, for every t € T, 
there exists at least one s € S such that a(s) = ¢. 


Example 1.25. Define a and f as in Example 1.24. Then q@ is not onto, since there is 
noa € R such that w(a) = —1. However, if b € R, then B(/b) = b; thus, B is onto. 


We should not get the idea that one-to-one and onto always occur together. 


Example 1.26. Define a : R > R via a(a) = 2%. Then @ is one-to-one, for if 2° = 
2°, then taking the base 2 logarithm of both sides, we see that a = b. On the other 
hand, there is no a € R such that 2“ = —1, so @ is not onto. 


However, it is nice when we can combine the two properties. 
Definition 1.12. A function a : S — T is bijective if it is one-to-one and onto. 


An equivalent way of expressing this property is that for each t € T, there is 
exactly one s € S such that w(s) = t. There must be such an 5s, since @ is onto, but if 
a(s;) = a(s2) = t, for some 51, 52 € S, then since @ is one-to-one, s; = 52. For this 
reason, a bijective function is also known as a one-to-one correspondence. 


Example 1.27. Combining Examples 1.24 and 1.25, we see that w : R > R given 
by a(a) = a’ is bijective. 


Let us discuss how to combine functions. 


Definition 1.13. Let R, S and T be sets, and let a: R—> S and 8B: S > T be 
functions. Then the composition, 6 o a, or simply Ba, is the function from R to T 
given by (Ba)(r) = B(a(r)) forallr € R. 


Note that when we write Ba, we are applying a first, then 6. The order is important! 
Indeed, depending upon the sets involved, it is possible that applying 6 first, then a, 
would not make sense. But even if it did make sense, the result would not necessarily 
be the same. 


Example 1.28. Define functions a and f from R to R viaa(a) = a* + 1 and B(a) = 
a’, for alla € R. Then (Ba)(a) = B(a? + 1) = a® + 2a? + 1, whereas (@f)(a) = 
a(a*) = a® + 1, for all a € R. That is, Ba and a are different functions. 


We can list a few important properties of the composition of functions. 
Theorem 1.2. Leta: R—> S,B:S— Tandy :T — U be functions. Then 
I. (yB)a = y (Ba); 


2. ifa and B are one-to-one, then so is Ba; 
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3. ifa and B are onto, then so is Ba; and 
4. ifa and B are bijective, then so is Ba. 


Proof. (1) Take any r € R. Then ((yB)a)(r) = (vB) (a(r)) = y(B(@(r))). Simi- 
larly, (y(Ba))(r) = y((Ba)(r)) = y(B(@(r))). 

(2) Suppose that (6a)(r1) = (Ba)(r2) for some r},72 € R. Then B(a(r1)) = 
B(a(r2)). Since B is one-to-one, a(r;) = a(r2). Since @ is one-to-one, ry = Po. 

(3) Take any ¢t € T. Since f is onto, there exists ans € S such that 6(s) = ¢. Buta 
is also onto, so there exists anv € R such that a(r) = s. Thus, (6a)(r) = B(a(r)) = 
B(s) =t. 

(4) Combine (2) and (3). 


The following additional property of bijective functions can be useful. 


Theorem 1.3. Let a: S — T be a bijective function. Then there exists a bijective 
function B : T — S such that (Ba)(s) = s for all s € S and (aB)(t) =t for all 
te’. 


Proof. Since a is bijective, for any t € T, there is a unique s € S such that a(s) = ¢. 
Define 6 : T > S via B(t) = s. By definition, we have (6a)(s) = B(a(s)) = s, for 
all s € S. Also, if t € T, then choosing s such that w(s) = t, we have B(t) = s, and 
therefore (wf) (t) = a(B(t)) = a(s) = ¢, as required. It remains to show that f is 
bijective. But if B(t,) = B(t2), then 


ty = (@B)(t1) = a(B(t)) = a(B(h)) = (@B)(h) = ha, 


so 6 is one-to-one. Furthermore, if s € S, then B(a@(s)) = (Ba)(s) = s, and hence 
B is onto. 


Example 1.29. Let a: IR > R be given by a(a) = a? for all a. Example 1.27 
showed us that @ is bijective. It is easily checked that if we let 6 : R > R be given 
by B(a) = «Ya for all a, then (wB)(a) = (Ba)(a) = a for all a. 


We close the chapter by defining two special types of functions. 
Definition 1.14. A permutation of a set S is a bijective function from S to S. 


Example 1.30. By Example 1.27, the function w : R > R given by a(a) = a? isa 
permutation of R. 


Example 1.31. Let S = {1,2,3,4}. Define a: S$ > S via a(1) = 3, a(2) = 2, 
a(3) = 4 and a(4) = 1. Then a is a permutation of S. 


As this last example illustrates, a permutation is simply a rearrangement of the 
elements of S. 


Definition 1.15. Let S be a set. Then a binary operation on S is a function from 
Sx Sto S. 
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Example 1.32. Wecan define a binary operation + on R viaa * b = 2a*b — 3b* +5, 
for all a,b € R. (Putting this in terms of functions, we could write a((a, b)) = 
2a*b — 3b4 + 5 for alla, b € R.) 


Note that in order to obtain a binary operation, we must stay within our original 


set. For instance, we would not get a binary operation on N if we tried toleta * b = ¢, 


for the simple reason that | « 2 = 5 ¢N. 
Exercises 


1.23. Define a: {1, 2, 3,4} > {1, 2, 3, 4,5, 6, 7} via w(a) = 2a — 1. Is this func- 
tion one-to-one? Is it onto? 


1.24. Define aw: R > R viaa(a) = Ya + 1 — 2. Is this function one-to-one? Is it 
onto? 


1.25. Let S be the set of real numbers and T the set of positive real numbers. Define 
a: S — T via a(a) = 234-5. Show that @ is a bijection and find B : T > S such 
that (6a)(a) = a forallae S. 


1.26. Definea: R — Rvia 


4a-—3, a<l 
a(a)=}", 
a’, a>l. 


Show that @ is bijective and find 6 : R — R such that (6a)(a) = a foralla € R. 
1.27. Which of the following are binary operations on N? 


1. axb=ab 
2. axb=a-—b 
3. axb=3 foralla andb 


1.28. Let S be a finite set, and suppose that a : S — S is a one-to-one function. 
Show that @ is a permutation of S. Construct an explicit counterexample to show that 
this need not be true if S is infinite. 


1.29. Leta: R — S and 6B: S > T be functions, and suppose that Ba is onto. 
Must a be onto? Must 6? 


1.30. Let a: R > Sand 8 : S > T be functions, and suppose that Ba is one-to- 
one. Must a be one-to-one? Must £? 


1.31. Let S be a set with m elements and T a set with n elements, for some positive 
integers m and n. 
1. How many functions are there from S to T? 


2. How many of these functions are one-to-one? 


1.32. Let S and T be sets and a : § — T a function. Show that there exist a set R 
and functions 8 : S > R and y : R > T such that 6 is onto, y is one-to-one and 


a=yf. 


Chapter 2 @) 
The Integers and Modular Arithmetic sheet 


In this chapter, we begin with a discussion of mathematical induction. Next, we 
examine a number of properties of the integers, with an emphasis on divisibility and 
prime factorization. We conclude by introducing modular arithmetic. 


2.1 Induction and Well Ordering 


We begin with an important property of the set of natural numbers. 


Property 2.1 (Well Ordering Axiom). If S isa nonempty set of positive integers, then 
S has a smallest element. 


This seems so obvious, but it is actually a rather special property of N. Indeed, Z 
has no smallest element; neither, for that matter, does the set of positive real numbers. 

There is an equivalent form of the Well Ordering Axiom that is especially useful. 
To state it, we need a definition. A proposition is a statement that is either true or 
false. For instance, “Ottawa is the capital of Canada” is a true proposition, and “There 
are only finitely many even integers” is a false one. We avoid statements having no 
truth value, such as “This statement is false” as well as statements that are a matter 
of opinion, such as “Xena: Warrior Princess was a great television program”!. What 
we would like to do is define a sequence of propositions, P(1), P(2), P(3) and so 
on, and prove that all of them are true at once. This is where induction comes in. 


Theorem 2.1 (Principle of Mathematical Induction). Suppose that, for each pos- 
itive integer n, we have a proposition P(n). Further suppose that 


‘Of course, any reasonable person would agree with this statement, but in principle, it is a matter 
of opinion. 
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I. P(A) is true; and 
2. for eachn EN, if P(n) is true, then so is P(n + 1). 


Then P(n) is true for every positive integer n. 


Proof. Suppose the theorem is false, and let S be the set of all positive integers n 
such that P(n) is false. Then S is a nonempty subset of N. By the Well Ordering 
Axiom, S has a smallest element k. Now, we are assuming that P(1) is true, sok > 1. 
Then k — 1 ¢ S,andhence P(k — 1) is true. By our assumption, P (k) is true as well, 
giving us a contradiction and completing the proof. 


Induction is a powerful tool! We can prove infinitely many propositions in just 
two steps. Here is a simple example. 


Example 2.1. We claim that for every positive integer n, we have 


n(n + 1)(2n + 1) 


P424...472 = ; 


We proceed by induction. For each n € N, the proposition P(n) is the statement 


1) 1 
PEP pp yt a MOTOR) 


First, we must prove P(1). But it states that 


_ 1d +H2Q-14+1) 


12 
6 ’ 


which is obvious. Now, we assume P() and prove P(n + 1). But 


n(n + 1)(2n + 1) 


7 + (n+ 1)’, 


P4274..-4n7 4+ (041% = 


by our inductive hypothesis, P(n). Simplifying, we have 


i See ee Se 

_ (n+ 1I)Qn? + In +6) 

_ mt 1G 4. 2)(0n +3) 

_ mt Na+ D+ D2 + 1) + wy 
6 


But this is precisely P(n + 1). Thus, the proof is complete. 
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There is another result that we can prove by induction, and which we will 
need later. A bit of notation is required. For any positive integer n, we define n! 
(read “n factorial’) via n! = n(n — 1)(n — 2)--- (2)(1). Also, 0! = 1. If and k are 
integers, with n > k > O, then we define (7) (read “‘n choose k’’) via (7) = raare 


Example 2.2. We have 5! = 5-4-3-2-1=120and (6) = &, = 2 = 15. 


Theorem 2.2 (Binomial Theorem). Let a and b be real numbers and n a posi- 
tive integer. Then 


(a+ 5" =a" + (TVarto (Sa 202 +--+ ( n | )ar oe 
n— 


Proof. Let us proceed by induction on n. When n = 1, both sides of the equation are 
a +b, so there is nothing to do. Assume the result for n, and prove it for n + 1. But 


(a+b)"*! = (a+b)"(a+b) 


n n n—-1 n n—1 n 
=la"+ 1 a’ b+eeet+ 1 ab" +b" ) (a+b), 
n— 


by our inductive hypothesis. 

When we expand this product, we obtain a sum of terms consisting of a coefficient 
multiplied by a”*+!~*b*, where 0 < k < n + 1. The coefficients of a”+! and b"*! are 
clearly 1, whereas if 0 < k < n+ 1, then the coefficient of a”+!~*b* is () + (as 
since these terms arise from ((7)a”-*b‘)a and ((,",)a"~*—Db'—!)b. However, 


n n 7 n! n! 
(‘) . (," , ~ (n — k)!k! ” (n-—k+1)'(k—- 1)! 
_ ni(n—k+1)+nk 


~  n—k+ 1k! 


es) 


1 1 
(a+ ott mately ("F Jarret ("* ar oe 


That is, 


and the proof is complete. 


Sometimes, a slightly different form of induction is required. 


Theorem 2.3 (Strong Induction). Suppose that, for each positive integer n, we 
have a proposition P(n). Further suppose that 
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1. P(A) is true; and 
2. for each integer n > 1, if P(k) is true for every k <n, then P(n) is true. 


Then P(n) is true for every positive integer n. 


Proof. Suppose that the theorem is false, and let S be the set of positive integers n 
such that P(n) is false. Then S is a nonempty subset of N. By the Well Ordering 
Axiom, ithas asmallest element 7. As P(1) is true, 7 > 1. But then by the minimality 
of j, we see that P(k) is true whenever k < j. Thus, P(/) is true, giving us a 
contradiction. 


As before, we must prove the first proposition. But after that, instead of just 
assuming that the previous case is true, we assume that all prior cases are true. This 
can give us more to work with. 


Example 2.3. Define a sequence via a; = 1, az = 3, a3 = 7 and, for each n > 4, 
An = An—-1 + Qn-2 + Gy_3. We claim that a, < 2” for all n € N. We need strong 
induction here, because when we consider a, we require information not just about 
dy—1, but about the terms before it as well. When n = 1, there is nothing to do. 
Assume that n > | and that the claim is true for smaller values of n. If n = 2 or 3, 
again, the result is obvious, so assume that n > 4. Then ad, = ay—1 + Gn—2 + Gn-3 < 
gr! 4 2"-2 4 2-3, by our inductive hypothesis. However, 2”! + 2”-? + 2"-3 = 
7-2"-3 < 2”, We are done. 


Exercises 


2.1. Show that for every positive integer n, 


n(n + 1) 


ee ee = 
a ia 5 


2.2. Show that for every positive integer n, 


n(n + 1)(n 4+ 2) 


1-242-3+---ta(nt+l= 3 


2.3. Show that for every positive integer n, the following two identities hold. 


3) 0) C0) =2 
()-()+6)-)++er (= 


2.4. In the plane R?, let us draw n lines, no two of which are parallel and no three 
of which meet at a point. Into how many regions do they divide the plane? 
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2.5. Show that for all integers n > 2, we have 

1. +a)" > 1+ 2a, for all positive real numbers a; and 

2. wn <2-4+, 

2.6. Show that e) is less than 4"~! for all positive integers n > 5. 


2.7. We define the Fibonacci sequence via fj = f2 = 1, and ifn > 2, then f, = 
fn—1 + fn—2- Show that, for every positive integer n, f,, < (7/4)""!. 


2.8. With f, as in the preceding exercise, show that for every positive integer n, 


oxy -(58) 
meek 


2.9. A bar of chocolate is a rectangular array consisting of r rows and c columns 
of unit square chocolate pieces, with thin lines separating the rows and columns. A 
single action consists of taking one bar, and breaking it along a line separating two 
rows or two columns, producing two smaller bars. Show that it will take precisely 
rc — | such actions to turn the bar into rc square pieces. (This can be done using 
strong induction, or with no induction at all.) 


fn = 


2.10. Show that for every positive integer n, there exist a positive integer k, and 
integers a; € {0, 1}, such that n = ap + 2a, + 22ay + 23a, +--+ Dag. 


2.2 Divisibility 


The following theorem simply formalizes the usual division process in the integers. 


Theorem 2.4 (Division Algorithm). Let a,b € Z with b > 0. Then there exist 
unique integers q andr such thata = bq +r, withO <r <b. 


Proof. We will prove the existence of q and r first, and then worry about their 
uniqueness. Let S = {a — bt: t € Z,a — bt > O}. If 0 € S, then a — bg = 0 for 
some q € Z, and hence a = bq + 0, as desired. Therefore, we may assume that 
S CN. Weclaim that Sis nonempty. Lett = —|a|. Thena — bt = a+ |a|b. Ifa > 0, 
then a + |a|b => 0, since b > 0. Ifa < 0, then a + |a|b = a(1 — b). Buta < 0 and 
since b > 1, wehave | — b < 0. Thus, a(1 — b) => O. Either way, the claim is proved. 

In view of the Well Ordering Axiom, S has a least element, say r = a — bq. By 
definition, r > 0.Ifr > b, thenO <r—b <r, butalsor —b=a—bq—b=a 
b(q + 1), and therefore a — b(q + 1) is a smaller element of S than r, contradicting 
the choice of r. Thus, a = bq +r, withO <r <b. 

As to uniqueness, suppose that a = bq; +r; = bq2 + 1r2, with gj,r; € Z and 
0 <r; < b. Then b(q; — q2) = r2 — 7). In particular, b|g, — q2| = |ro —1r1|.Ifqi A 
q2, then blg, — q2| => b. But 0 < 171, r2 < b, so |r2 — r1| < b, which is impossible. 
Therefore, g; = q2. But then r; = r as well. 
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We call q andr in the preceding theorem the quotient and remainder respectively. 
Example 2.4. Using b = 5, we have 68 = 5(13) + 3 and —21 = 5(—5) +4. 
The case in which the remainder is 0 is of particular interest. 


Definition 2.1. Let a and b be integers. We say that a divides b (or b is a multiple 
of a) if there exists an integer c such that b = ac. In this case, we write a|b. 


Example 2.5. As 84 = 6(14) and 84 = —3(—28), we write 6|84 and —3|84. On the 
other hand, 10 { 84. 


Here are a few basic properties of divisibility, the proofs of which are left as 
Exercise 2.14. 


Lemma 2.1. Let a,b,c € Z. Then 


1. ifa|b and b\c, then a\c; 
2. ifa|b and b £0, thena < |b|; and 
3. ifa\b and a|c, then a\(bu + cv) for any u,v € Z. 


Definition 2.2. Let a and b be integers, not both 0. Then the greatest common 
divisor (or ged) of a and b, written (a, b), is the largest positive integer g such that 
gla and g|b. 


Example 2.6. We have (60, 170) = 10 and (42, —55) = 1. 


Note that the gcd must always exist. As | divides everything, a and b must have 
a common divisor. Also, by Lemma 2.1, if a 40, then (a, b) < |a|. Thus, only 
the numbers from | to |a| need to be considered. We specifically exclude the case 
a = b = 0, since everything divides 0. 

Let us mention a couple of easy facts about gcds. 


Lemma 2.2. Take any integers a and b witha 4 0. Then 
I. (a,b) = (—a, b); and 
2. (a,0) = |al. 


Proof. (1) Any divisor of a also divides —a, and vice versa. 
(2) Clearly |a| divides both a and 0, and Lemma 2.1 shows that no larger integer 
can do so. 


One particular case is important. 


Definition 2.3. Let a,b € Z, not both 0. Then we say that a and b are relatively 
prime if (a, b) = 1. 


Example 2.7. By Example 2.6, 60 and 170 are not relatively prime, but 42 and —55 
are. 
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Why is the gcd so significant? The following theorem gives us an idea. 


Theorem 2.5. Let a and b be integers, not both 0. Then there exist u,v € Z such 
that (a,b) = au + by. Furthermore, (a, b) is the smallest positive integer that can 
be written in this way. 


Proof. Let S = {ax + by: x, y € Z,ax + by > 0}. Clearly S C N. Without loss of 
generality, we may assume that a 4 0. Then a? + b(0) = a” € S, and hence S is not 
empty. By the Well Ordering Axiom, S has a least element, say g = au + bv. We 
claim that g = (a, b). This will complete the proof. 

Suppose that cla and c|b. By Lemma 2.1, c|g and, hence, c < g. It remains only 
to show that g divides both a and b. Using the division algorithm, write a = gq +r, 
where g and r are integers and 0 < r < g. Then 


r=a-—gq=a-—(au+bv)qg =a(l —uqg) + b(-vq). 


Thus, ifr > 0, thenr € S. Butr < g, contradicting the minimality of g. Therefore, 
r =O and gla. By the same argument, g|b. 


The following is an immediate consequence. 


Corollary 2.1. Let a,b € Z, not both 0. Then a and b are relatively prime if and 
only if there exist integers u and v such that au + bv = 1. 


We can now prove a couple of useful results for relatively prime numbers. 


Corollary 2.2. Let a,b,c € Z witha and b not both 0. If (a, b) = 1 and a|bc, then 
alc. 


Proof. By the preceding corollary, we may write au+bv=1, for some u, 
v € Z. Then acu + bev = c. But ala and albc hence, by Lemma 2.1, a|(acu + 
bev) =c. 


Corollary 2.3. Leta, b € Z, not both 0. Ifa and b are relatively prime, and for some 
integer n, we have a|n and b\n, then ab|n. 


Proof. See Exercise 2.18. 


Be careful not to apply the last two corollaries if a and b are not relatively prime! 
For instance, 6|4 - 3, but 6 { 4 and 6 { 3. Also, 4|12 and 6|12 but 24 ¢ 12. 

What we have not yet discussed is how to find (a, b) and the numbers u and v from 
Theorem 2.5. We could certainly list the common divisors of a and b and see which 
one is largest, but if the numbers are large, this would be rather time-consuming. It 
would also give us no insight into finding u and v. Happily, there is a better way. The 
following technique is attributed to the ancient Greek mathematician Euclid. 
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Theorem 2.6 (Euclidean Algorithm). Let a and b be integers, with b positive. If 
bla, then (a, b) = b. Otherwise, apply the division algorithm repeatedly. Let 


a=bq+n 
b=rnqot+r2 


MN =N93 +13 


Th-2 = Tk-19k + 1k 


Tel = Veep +9, 
where qi,r; € Z for alli andO < rp < rp_| < +++ <1, <b. Then (a, b) = rx. 


Proof. If bla, then b is a common divisor of a and b. In view of Lemma 2.1, it is 
the largest possible common divisor. Assume that b { a. Note that we will only apply 
the division algorithm finitely many times, as each r;+; < 7;, and all are positive. 
Suppose that cla and c|b. By Lemma 2.1, c|(a — bqi) = r1. Thus, every common 
divisor of a and b is also a common divisor of b and r;. But if d|b and d|r;, then 
d|(bq, +1) = a. That is, the common divisors of a and b are precisely the same as 
those of b and r;. In particular, (a, b) = (b, r,). But by exactly the same argument, 


(a, b) = (b,r1) = (11, 72) = (72,73) = +++ = (re, 0) = re, 


by Lemma 2.2. 


We do require b to be positive in the Euclidean algorithm, but we can use the fact 
that (a, b) = (—a, b) if neither a nor b is positive. 

In fact, the Euclidean algorithm is doubly useful, because if we start with the 
penultimate equation and work our way backwards, we can find integers uw and v 
such that (a, b) = au + bv. Indeed, we have 


(a, b) = rp = re-21) + re-1(— Gx), 


and so (a, b) is a multiple of r,_2 plus a multiple of r,_;. But then 


Pk-1 = Tk-3 — Tk-29k-1 
and substitution yields 
(a, b) = re—2(1) + (re-3 — Pe-24K-1) (— 9k) = Pe-2 + Gk-19k) + Pk-3(—Gk)- 


That is, (a, b) is a multiple of 7,_3 plus a multiple of 7;_2. Eventually, we will write 
it in the desired form. 
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Example 2.8. Let a = 45 and b = 33. Applying the Euclidean algorithm, we have 


45 = 33(1) + 12 


a3 DOy49 
{2= 9(1) 43 
9 = 3(3) +0. 


Thus, (a, b) = 3. Let us find u and v such that au + bv = 3. We have 


32 oe) 
= 12(1) + (33) + 12(-2))(-1) 
= 1s e551) 
= (45(1) + 33(—1))(3) + 33(—1) 
= 45(3) + 33(—4). 


That is, (a, b) = 3a — 4b. 
Exercises 


2.11. In each case, use the Euclidean algorithm to find (a, b). 


1. a=57,b=20 
2. a=117,b=51 


2.12. For each of the two parts of the preceding problem, find integers u and v such 
that (a, b) = au + by. 


2.13. Let a and b be integers such that a|b and b|a. Show that a € {b, —b}. 
2.14. Prove Lemma 2.1. 


2.15. Show that if a, b and c are positive integers, with (a, b) = 1, and cla, then 
(c,b) = 1. 


2.16. Show that n> — n is divisible by 5 for every positive integer n. 


2.17. Let a and n be positive integers. Show that there exists an integer u such that 
n|(au — 1) if and only if a and 7 are relatively prime. 


2.18. Let a,b € Z, not both 0. If a and D are relatively prime, and for some integer 
n, we have a|n and b|n, show that ab|n. 


2.19. Take f;, as in Exercise 2.7. Show that 3| f, if and only if 4|n. 


2.20. Take f, as in Exercise 2.7. Show that 4| f, if and only if 6|n. 
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2.3. Prime Factorization 


Prime numbers will have a special importance throughout the course. 


Definition 2.4. A natural number p > | is said to be prime if its only positive 
divisors are | and p. Otherwise, it is composite. 


Note that 1 is neither prime nor composite. 
Example 2.9. The first few primes are 2, 3,5, 7,11, 13,17,.... 


An equivalent way of defining a prime number is given in the following result due 
to Euclid. 


Theorem 2.7 (Euclid’s Lemma). Let p > | bea positive integer. Then the following 
are equivalent: 


1. pis prime; and 
2. ifaand b are integers such that p|ab, then p|a or p\b. 


Proof. Suppose that p is prime and p|ab. Now, (p,a)|p, so (p,a) = 1 or p. If 
(p, a) = p, then since (p,a)|a, we have p|a. Otherwise, by Corollary 2.1, there 
exist integers uw and v such that pu + av = 1. But then pbu + abv = b. Now, p|p 
and p|ab, so by Lemma 2.1, p|b. 

On the other hand, if p is composite, then let p = cd, where 1 < c,d < p. In this 
case, p|cd, but by Lemma 2.1, p { c and p { d. 


Corollary 2.4. Let p be a prime number and a,,..., 4) € Z. If p|ayao-++ dn, then 
P\ai, for some i. 


Proof. Exercise 2.24. 


In fact, every positive integer larger than | can be written as a product of primes, 
called its prime factorization. 


Theorem 2.8 (Fundamental Theorem of Arithmetic). [fa ¢ N anda > 1, then 
there exist primes P1,..., Pn (not necessarily distinct) such that a = p,p2--: Pn. 
Furthermore, this product is unique up to order. That is, if a = q\q2--- Gm, for some 
primes qi, then m = n and, after rearranging the primes, p; = q; for alli. 


Proof. Let us prove the existence of the prime factorization and then handle the 
uniqueness. We will prove the result by strong induction on a. We are excluding the 
case a = 1, so start with a = 2. There is nothing to do here, since 2 is prime. Thus, 
let a > 2 and assume that the theorem is true for smaller numbers. If a is prime, 
there is nothing to do. Otherwise, we can write a = bc, with 1 < b,c < a. But then 
by our inductive hypothesis, b and c are both products of primes, and hence a is a 
product of primes. 
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Now let us prove the uniqueness. Suppose that 


4=>= Pi-**Pn=41°*' Im; 


for some primes p; and g;. Without loss of generality, sayn < m. Now, p;|a. Thus, by 
Corollary 2.4, pi|qi, for some i. Rearranging the primes as needed, we may assume 
that pi|qi. But q; is prime, so p; = | or qy. As | is not prime, p; = q. Cancelling 
P, and q, from the two sides of our equation, we have 


P2°** Pn = 42°**|m- 


Now do the same for pz and repeat. We find that, after rearranging, p; = gj, 1 < 
i <n. Ifm =n, we are done. Otherwise, we are left with 1 = gy) +++ mn. But then 
dm|1, which is impossible, as g,, > 1. 


Example 2.10. We can write 1400 = 2-2-2-5.-5.-7, and there is no other way to 
write 1400 as a product of primes, except by rearranging (for instance, 2-5-7-2- 
2-5). 


Note that this gives us one good reason not to consider | as a prime: we would 
have to abandon uniqueness, as we could multiply by 1 as many times as we wanted. 
We can use the existence of prime factors to prove a handy fact. 


Corollary 2.5. Let a, b and n be integers, with n # 0. If (a,n) = (b,n) = 1, then 
(ab,n) = 1. 


Proof. If (ab, n) > 1, then by Theorem 2.8, there exists a prime p dividing (ab, n). 
Since p|ab, Theorem 2.7 tells us that p|a or p|b. But p|n as well; thus, (a, n) > p 
or (b,n) > p. Either way, we have a contradiction. 


Exercises 


2.21. Factor each of the following numbers into a product of primes: 3528, 30030 
and 220000. 


2.22. Show that for every prime p > 3, there exists a positive integer k such that 
p=6k+1 or p= 6k — 1. 


2.23. Let p be a prime and n an integer. Show that either p|n or (p,n) = 1. 
2.24. Use induction to prove Corollary 2.4. 

2.25. Let pi,..., px be any primes. Show that for each i, p; { (pip2-++ pe + 1). 
2.26. Use the preceding exercise to show that there are infinitely many primes. 
2.27. Let p be a prime and a,n € N. Suppose that p|a”. Show that p”|a”. 


2.28. Let p;,..., px be distinct primes, and let m;, n; be nonnegative integers. Find 
my Nk 


the gcd of p} ++ py and pj! - +> py". 
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2.4 Properties of the Integers 


This section may seem a tad underwhelming. Indeed, there are no proofs at all and 
we will not really learn any new facts about the integers. The whole point is to 
establish some terminology that we will see many times in different settings. While 
our discussion will take place in Z, it is worth noting that we could just as easily use 
Q, Ror C. 

First, we observe that Z is closed under addition and multiplication. That is, 


a+b,abeZ 


for alla, b € Z. 
Next, addition and multiplication on Z are both associative. This means that 


(a+b)+c=a+(b+c) and (ab)c = a(bc) 


for all a,b,c € Z. In particular, we can write a+ b+ c and abc without fear of 
ambiguity. 
Furthermore, addition and multiplication are both commutative on Z. In other 
words, 
a+b=b+aandab=ba 


for alla, b € Z. 
We also have the distributive law. Specifically, 


a(b+c) =ab+ac 


for alla, b,c € Z. 
The numbers 0 and 1 are rather special. We call 0 the additive identity for Z and 
1 the multiplicative identity. This is because 


a+0O=aanda:-l=a 


for alla € Z. 
Finally, if a € Z, then —a is its additive inverse. This means that 


a+(-a) = 0. 


It is important to note that we do not have multiplicative inverses for all integers; 

that is, if a € Z, it does not follow that there exists a b € Z such that ab = 1. In fact, 

this only happens if a is 1 or —1. (The sets Q, R and C are a bit different on this 

last point. Every element other than 0 has a multiplicative inverse in these sets. For 
9 


instance, in Q, the multiplicative inverse of 5 is 5.) 
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Exercises 


2.29. For each of the following binary operations on Z, decide if it is commutative; 
that is, do we havea *b = b xa foralla,b € Z? 


l. axb=ab+1 
2. a*x*b=a+b+ab 
3. axb=a 


2.30. For each part of the preceding exercise, are the operations associative? That 
is, do we have (a x b) *c = ax (b *c) foralla, b,c € Z? 


2.31. For parts (1) and (2) from Exercise 2.29, decide if * has an identity; that is, 
does there exist an e € Z such that a xe =e xa=a foralla € Z? 


2.32. Define a binary operation * on Q via a * b = a+b —ab. Find an identity 
e; that is, find e € Q such that a * e =e * a =a for all a € Q. Then decide which 
elements of Q have inverses. That is, determine for which b € Q there exists ac € Q 
such thatb*c=cxb=e. 


2.5 Modular Arithmetic 


When we perform modular arithmetic, we choose an integer n > 2 and then for any 
integer a, we concern ourselves only with the remainder when a is divided by n. As 
the only possible remainders are 0, 1, 2,..., — 1, these are the only numbers to 
worry about. 


Definition 2.5. Let n > 2 be an integer. Ifa, b € Z, then we say that a is congruent 
to b modulo n, and write a = b (mod n), if n|(a — b); that is, if a and b have the 
same remainder when divided by n. 


Example 2.11. As8|(53 — 21),wehave53 = 21 (mod 8). Putting this another way, 
53 and 21 both have remainder 5 when divided by 8. We reduce to the remainder and 
write 53 = 5 (mod 8) and 21 = 5 (mod 8). 


We add and multiply modulo n in the usual way, simply reducing to the remainder. 
Example 2.12. We observe that 
5+8=1 (mod 12) and5-8=4 (mod 12). 


Of course, we should be a bit careful here. For instance, since 5 = 17 (mod 12), 
we had better make sure that 5+ 8 = 17+ 8 (mod 12). This is certainly the case, 
but it will help if we express things in terms of equivalence classes. 
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Theorem 2.9. Letn > 2 be an integer. Thena = b (mod n) is an equivalence rela- 
tion on Z. The equivalence class of a consists of all integers having the same remain- 
der as a when divided by n. 


Proof. Reflexivity: We have n|O = a—a, so a=a (mod n). Symmetry: Sup- 
pose that a = b (mod n). Then n|(a — b), and hence n| — (a — b) = b — a. Thus, 
b =a (mod n). Transitivity: Suppose that a = b (mod n) and b=c (mod n). 
Then n|(a — b) andn|(b — c). Hence, n|((a — b) + (b — c)) =a — cc. Thatis,a =c 
(mod n). The statement about the equivalence classes follows from the definition. 


Definition 2.6. Let n > 2 be an integer. The set of integers modulo n, denoted Z,, 
is the set of all equivalence classes of Z with respect to the equivalence relation 
a =b (mod n). We call these the congruence classes modulo n. Specifically, Z, = 


{[0], [1], [2], ..., [a — 1]}. 
Example 2.13. The elements of Z4 are [0], [1], [2] and [3], where 


[0] = {..., -, 8, —4, 0, 4, 8,...} 
[1] ={...,-7, -3, 1,5,9,...} 
[2] = {..., -6, —2, 2, 6, 10, ...} 
[3] = { 


As usual, in dealing with equivalence classes, the choice of the representative of 
the class is not unique. For instance, in the above example, we could just as easily 
have written [—5] or [7] instead of [3]. It is, however, customary to reduce final 
answers in Z, to the form [a], where 0 < a <n. 

We can now define addition and multiplication on Z,,. These work in the obvious 
way. Specifically, 


Example 2.14. In Z7, we have [5] + [2] = [7] = [0] and [5][3] = [15] = [1]. 


Theorem 2.10. For any integer n > 2, addition and multiplication on Z, are well- 
defined. 


Proof. Suppose that a; = a) (mod n) and bj = by (mod n). Then 
(a, + bi) — (a2 + bz) = (ay — a2) + (D1 — 2). 


Since n|(a, — az) and n|(b; — bz), we see that n|((a, + D1) — (a2 + b2)). That is, 
[a, + b;] = [az + bo], so addition is well-defined. Also, 


a,b, — aby = (a,b, — aybz) + (aybz — azb2) = ay (by — bz) + (a1 — a2)bp. 


Since n|(a; — az) and n|(b; — bz), we find that n|(a,;b; — azbz). That is, [a;b;] = 
[a2b>2], and multiplication is well-defined. 
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Let us discuss a few properties of addition and multiplication in Z,. These should 
be compared with the properties of Z mentioned in Section 2.4. 


Theorem 2.11. Letn > 2 be an integer, and take any [a], [b], [c] € Z,. Then 


1. [a] + [b] € Z, (closure under addition) 

2. [a] + ([b] + [c]) = ((a] + [b]) + [c] (associativity); 
3. [a] + [b] = [b] + [a] (commutativity); 

4. [a] + [0] = [a] (additive identity); and 

5. [a] + [—a] = [0] (additive inverse). 


Proof. (1) is clear from the definition. The other parts all work because they work 
in Z. For instance, [a] + [b] = [a+ b] = [b+ a] = [b] + [a], proving (3). The 
remaining parts are left as Exercise 2.35. 


And now, some properties of multiplication. 


Theorem 2.12. Letn > 2 be an integer, and [a], [b], [c] € Zy,. Then 


b] € Z, (closure under multiplication); 


1. [all 

2. [al (ble) = (lal[b) [ce] (associativity); 

3. [a][b] = [b]la] (commutativity); 

4. [a]({b] + [c]) = [a@][b] + [alle] (distributive law); and 
5. [a]{1] = [a] (multiplicative identity). 


Proof. (1) follows from the definition, and the other parts are true because they are 
true in Z. For instance, 


[a]([b] + [e]) = la]lb +c] = [ab +c)] 
= [ab + ac] = [ab] + [ac] = [a][] + [alle], 


proving (4). The rest is left as Exercise 2.36. 


As in Z, we do not necessarily have multiplicative inverses. For instance, in Z4, 
we find that [5][3] = [1], but there is no integer a such that [6][a] = [1]. However, 
Zs behaves more like Q; indeed, if [a] 4 [0], then there exists a [b] € Zs such that 
[a][b] = [1]. More on this later! 

It is worth mentioning that Z, does not behave exactly like Z. For instance, in 
Z, we are used to the fact that if ab = 0, then a = 0 or b = O. But in Z)2, we have 
[4][9] = [0]. We are also accustomed to cancellation in Z; that is, if ab = ac, and 
a # 0, then b = c. Not necessarily true in Z,! For example, in Z17, we have [2][3] = 
[2][9], but [2] ~ [0] and [3] 4 [9]. So we must be careful with our assumptions. 

And now, having acquainted ourselves with Z,, we are going to make a change 
in notation. It is rather cumbersome to have to write [a] or [a] + [b] all the time. 
Therefore, when working in Z,,, we will normally simply write a or a + b, as long as 
the context is clear. We will include the equivalence class brackets if they are needed 
for greater clarity. 


Example 2.15. When working in Z9, we simply write 3 + 8 = 1 and3-8 =4 
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We need to prove one last property of modular arithmetic, which dates back to 
ancient China. 


Theorem 2.13 (Chinese Remainder Theorem). Let n,,..., 4 be positive inte- 
gers, all larger than 1, such that (n;,n;) = 1 whenever i # j. If a,...,q €Z, 
then there exists an integer b such that b = a; (mod nj) for all i. Furthermore, if 
c =a; (mod n;) for alli, thenb =c (mod njn2--- nx). 


Proof: For each i, let d; be the product of all of the n; except for n;; that is, dj = 
mnewwk Since (nj,nj) = 1 wheni ¥ j, Corollary 2.5 shows us that (n;, d;) = 1. By 
Corollary 2.1, there exist integers u; and v; such that n;u; + djv; = 1. Thus, d;v; = 1 
(mod n;). Let 

b = div + dyvad2 + +++ + dyvgag. 


Then since n;|d; ifi 4 j, we have 
b= djVidj = dj (mod nj), 
for all i, as required. 


Finally, if c= a; (mod n;) for alli as well, then b= c (mod n;) for all 7; that 
is, n;|(b — c) for all i. By Corollary 2.3, njnz2---ng\(b —c). 


Example 2.16. Let us solve the congruences b = 3 (mod 5), b =4 (mod 11) and 
b = 6 (mod 14). We have d; = 154, dz = 70 and d3 = 55. Solving 5u; + 154; = 
1 using the Euclidean algorithm, we get uw; = 31, vy) = —1. When we solve I luz + 
70v2 = 1, we get ur = —19, v2 = 3. Finally, a solution to 14u3 + 55v3 = Lis u3 = 
4, v3 = —1. Therefore, b = 154(—1)(3) + 70(3)(4) + 55(—1)(6) = 48. Thus, the 
solution is b = 48 (mod 770). 


Exercises 


2.33. Perform each calculation in Z7. The final answer should be a nonnegative 
integer no larger than 6. 


1.2-3-4 
2. (4-5)? 


2.34. Perform each calculation in Z\5. The final answer should be a nonnegative 
integer no larger than 14. 


1. 5-11-—3-4 
2. 2 


2.35. Complete the proof of Theorem 2.11. 
2.36. Complete the proof of Theorem 2.12. 


2.37. For each nonzero element a € Zy9, decide if there is a nonzero b € Zo9 such 
that ab = 0 in Zo. If so, provide such an element b. 
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2.38. For each element a € Zy0, decide if there exists a b € Zoo such that ab = | 
in Zoo. If so, provide such an element b. 


2.39. Show that if p is prime, then there are at most two elements a € Z, such that 
a=1inZ p- Find an example of a composite p where there are more than two 
solutions. 


2.40. Let a and b be integers. Show that if a = b (mod p) for every prime p, then 
a=b. 


2.41. Find a € Z such that a = 2 (mod 3), a =4 (mod 7) and a = 3 (mod 10) 
simultaneously. 


2.42. Finda € Z such that a = 3 (mod 8),a =4 (mod 11) anda =7 (mod 15) 
simultaneously. 


Part II 
Groups 


Chapter 3 ®) 
Introduction to Groups ra 


We now begin our study of abstract algebra in earnest! A group is one of the simplest 
algebraic structures; we take a set, assign an operation to it, impose four basic rules, 
and see what we can deduce. And yet, the possibilities are endless. Groups show 
up everywhere, and not just in mathematics. Indeed, it would be difficult to study 
physics or chemistry without an understanding of group theory. The solution to the 
famous Rubik’s cube is also a problem in groups. 

In this chapter, we will define the notion of a group, and give a number of examples. 
We will also prove several basic properties of groups and subgroups. 


3.1 An Important Example 


In the next section, we will give the definition of a group. For now, we will look at a 
motivating example. 

Let A be the set {1, 2, 3}. We would like to consider all of the permutations of 
A; that is, all the ways of rearranging the numbers 1, 2 and 3. For example, we have 
the permutation o, where o (1) = 2, 0(2) = 1 anda (3) = 3. We can easily see that 
there are going to be exactly 6 such permutations, as there are 3 choices for o (1), 
then 2 remaining choices for 0 (2), and once those are known, o (3) is determined. 

A bit of notation would be helpful. Let us denote a permutation o by writing two 
rows. The elements of A go in the first row, and the numbers to which each of them 
is sent in the second; that is, we take 


— 123 
~\abe 
to mean that o (1) = a,o(2) = bando (3) = c. Then the permutation we mentioned 
above would be denoted 
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123 
213)° 


In fact, the complete list of permutations is 


123 123 123 123 123 ve 123 
123)’ \132)? \213/)’ \23 1)’ \312 321)})° 


Let us now discuss the composition of two permutations. For instance, if 


= f123\ 4... (123 
= 5-44 > Vago) 3 


then we see that (0 0 T)(1) = o(t(1)) = 0 (3) = 3, (06 0 T)(2) =o (T(2)) =o (1) = 
2 and (o 0 T)(3) = a (t(3)) = 0 (2) = 1. Thus, 


_ (123 
ooT= 321)° 


(It is worth noting here that we apply Tt first, then o.) 

We can now consider some properties that these permutations enjoy with respect to 
this composition operation. As we discuss them, please compare with the properties 
of Z or Z,, under addition, with which we are already familiar. 

First of all, we have closure. That is, if we take two permutations of A and compose 
them, we obtain another permutation of A. In fact, we proved this in Theorem 1.2, 
where we saw that the composition of two bijections is a bijection. 

Next, we have associativity; that is, for any permutations p, o and T, we have 
po(o0T)=(poa) ot. We have seen this before as well; by Theorem 1.2, the 
composition of functions is always associative. 

Also, we have an identity. In particular, if o is any permutation of A, then 


123\ (123 _ 
PORTIS Via aye es 
Composing with the permutation that moves nothing cannot change a function. 
Finally, we have inverses; that is, for each permutation o, there is another per- 


mutation t such that 
pe _ {123 
OoT=TOG= 123)? 


the identity. This is easy enough to calculate directly; for instance, 


Ngee 123\_ (123 123\_ (123 
P30) Vio) > 1a)” Wes 1) — 1194) * 
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However, the existence of such an inverse is guaranteed by Theorem 1.3. 

Given our discussion in Sections 2.4 and 2.5, we can agree that all of these 
properties are shared by Z and Z,, under addition. However, we also noted that the 
addition operation is commutative. Not so here! For instance, 


i324 123). 7128 
31) \t3o)- \o13)% 
Los 12a) faa 
130) "193 i) = (aor). 


Thus, in general,o ot ATOO. 

These permutations under the composition operation give us a nice example of a 
group, as we shall see momentarily. There is, of course, nothing very magical about 
the set A = {1, 2, 3} here. Indeed, we could just as easily have used {1, 2,3, ..., n}, 


for any positive integer n. The set of all permutations of this set, under the composition 
operation, is called the symmetric group on 7 letters, and is denoted S,,. 


whereas 


Exercises 


1234 1234 . 
3.1. In S4, leto = t ca ») and t = is 4] TF Calculate the following. 


l. oT 
2. TO 
3. the inverse of o 


12345 12345 : 
3.2. In $5, leto = (5 321 ;) and t € 413 :): Calculate the following. 
1. ota 
2. OOT 


3. the inverse of o 


3.3. How many permutations are there in S,? In Ss, how many permutations a 
satisfy a(2) = 2? 


3.4. Let H be the set of all permutations a € Ss satisfying a(2) = 2. Which of the 
properties we have discussed (closure, associativity, identity, inverses) does H enjoy 
under composition of functions? 


3.5. Consider the set of all functions from {1, 2,3, 4,5} to {1, 2,3, 4,5}. Which 
of the properties (closure, associativity, identity, inverses) does this set enjoy under 
composition of functions? 


3.6. Let G be the set of all permutations of N. Which of the properties (closure, 
associativity, identity, inverses) does G enjoy under composition of functions? 
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3.2 Groups 


We can now give the general definition of a group. 


Definition 3.1. A group is a set G, together with a binary operation *, satisfying 
the following conditions: 


1. axb eG forall a,b € G (closure); 

2. (ax b)*c =a x (bx c) forall a, b,c € G (associativity); 

3. there exists an e € G such that axe =exa=a for all a € G (existence of 
identity); and 

4. for each a € G, there exists a b € G such thataxb=bxa=e (existence of 
inverses). 


We will refer to G as a group under «. 

The element e is called the identity of the group. Ifa € G,andaxb=bxa=e, 
then b is called the inverse of a, and we write b = a~!. 

As we discussed in the previous section, the group operation does not have to be 
commutative. We have a special term for groups that do have this property, named 
after mathematician Niels H. Abel. 


Definition 3.2. A group G is said to be abelian if a * b = b «a for alla, be G. 


We devote the remainder of this section to examples of groups. 


Example 3.1. As we saw in Sections 2.4 and 2.5, Z and Z,, (for any integer n > 2) 
are abelian groups under addition. Indeed, 0 is the identity, and the inverse of a is 
—a. In fact, the same can be said for Q, R and C under addition. 


When a group G has only finitely many elements, we can represent it with a group 
table. We write the elements of G down the first column and along the first row of 
the table. Then the entry in the row headed by a and the column headed by b is a x b. 
For instance, the group table for Zs is given in Table 3.1. 


Table 3.1 Group table for the additive group Zs 


Example 3.2. Let G be the set of nonzero rational numbers. Then G is an abelian 
group under multiplication. Indeed, we see that the product of two nonzero rationals is 
a nonzero rational, hence closure is satisfied. Also, multiplication of rationals is both 
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associative and commutative. Clearly, a - 1 =a, for all a € G, so 1 is the identity. 
Finally, if a = m/n € G, with m and n nonzero integers, thena~' = n/m € G, since 


(m/n)(n/m) = 1. 


This last example merits a second look. In particular, it is worth noting that we 
cannot do the same thing with the set of nonzero integers. To be sure, the product of 
two nonzero integers is a nonzero integer, and the multiplication is associative. Also, 
1 is the identity. But 2 has no inverse; that is, there is no integer a such that 2a = 1. 
In fact, the only integers that would have inverses in this set are 1 and —1. The set 
{1, —1} is easily seen to be a group under multiplication. 

Let us see how the integers modulo n compare. 


Example 3.3. Letn > 2 bea positive integer. Let U (n) denote the set of all elements 
a € Z, such that (a, n) = 1. (For instance, U(10) = {1, 3, 7, 9}.) Let us ensure that 
this makes sense. That is, if a = b (mod n), and (a, n) = 1, then it had also better 
be the case that (b,n) = 1. Buta = b +-nk, for some integer k. Then, if c divides 
both b and n, then c divides a as well. We claim that U (1) is an abelian group under 
the multiplication operation in Z,. First, closure. By Corollary 2.5, if (a,n) = 1 and 
(b,n) = 1, then (ab, n) = 1. We also know that multiplication in Z,, is associative 
and commutative, and | (which obviously lies in U(n)) is the identity. What about 
inverses? If a € U(n), then since (a, n) = 1, there exist integers u and v such that 
au +nv = 1. That is, in Z,, au = ua = 1. The group table of U(10) is given in 
Table 3.2. 


Table 3.2 Group table for the multiplicative group U (10) 
1379 


9/9731 


Note that we will use the notation U(n) from the above example throughout the 
book. 


Example 3.4. Letn be a positive integer, and let G be the set of all complex numbers 
w Satisfying w” = 1. Then we claim that G is an abelian group under multiplication. 
Of course, we know that the multiplication is both associative and commutative, and 
1 € G will serve as a multiplicative identity. If v, w € G, then (vw)” = v"w” = 1, 
so vw € G, and we have closure. Also, if w € G, then we know that 1/w € C. But 
(1/w)” = 1/(w") = 1, and therefore 1/w € G. 


In particular, if 7 = 4 in the above example, then we get the group {1, —1, i, —i}. 
Also, if m = 1, then we just get the group consisting of the identity. This is known 
as the trivial group. 

Of course, not all groups are abelian. Two useful examples follow. 
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Example 3.5. As we illustrated in the previous section, S, is a group under compo- 
sition. If n > 3, then the group is nonabelian. 


Example 3.6. The set of all invertible 2 x 2 matrices with entries in R is called 
the general linear group over R, and denoted GL>(R). It is a group under matrix 
multiplication. The identity matrix J, is the identity of GL2(R). Also, if A, B € 
GL>2(R), then 


AB(B7'!A7!) = A(BB"!)A7! = ALA! = AAST=Hh 
and, similarly, (B-!A-!)AB = 1). Thus, AB is invertible as well, so GL2(R) is 
closed under multiplication. Also, matrix multiplication is associative. By definition 


of GL2(R), every element A has an inverse, and since (A~!)~! = A, we know that 
A~! € GL2(R). Thus, GL2(R) is indeed a group. However, the group is nonabelian. 


For instance, 
11 10\ /21 
01 11) \11/)’ 
10\/11)\_ /11 
11) \01)/° \12)° 


By changing the entries in the matrices, we can obtain other general linear groups, 
such as GL2(Q). We can also use invertible n x n matrices and obtain GL, (IR). 


whereas 


Let us also present a useful way of obtaining new groups from old ones. 


Definition 3.3. Let G be a group with operation « and H a group with operation e. 
On the Cartesian product G x H, define an operation © via 


(g1, 41) © (g2, Az) = (81 * 82, h1 @ hz), 


for all g; € G, h; € H. Under this operation, we call G x H the direct product of 
G and H. 


Theorem 3.1. The direct product of two groups is a group. 


Proof. Let us adopt the same notation as in the definition. First, we must check that 
the direct product is closed. Butif g1, g2 € G,h1, hz € H, then (g1, h1) > (g2, ho) = 
(g1 * 22,h, eh2) € G x H, since g; * g2 € G, hy * hz € H. The associativity of 
follows from the associativity of « and e. Indeed, if g;, g2, g3 € G, hy, ho, h3 € H, 
then 
((g1, 41) © (82, A2)) > (93, 43) = (81 * 82, hi @ hz) > (g3, hs) 

= ((g1 * 82) * 83, (41 @ ha) eh) 

= (81 * (82 * 83), 1 @ (ha @h3)) 

= (81,1) © ((g2, ha) > (93, A3)). 
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Let eg and ey be the identities of G and H respectively. Then for any g € G, 
h € H, we have 


(g,h) > (eg, en) = (8 * eG, heey) = (g,h) 


and, similarly, (eg, ey) > (g, h) = (g, h). Thus, (eg, ey) is the identity for G x H. 
Furthermore, (g,h) o (g7!,h7!) = (gx g7!,heh7') = (eg, ex) and, similarly, 
(g-!,h7!) 0 (g,h) = (eg, ey). Thus, (g,h)~'=(g7!,h7!). The proof is 
complete. 


Example 3.7. Suppose that G = Z; and H = S;. TheninG x H, 


(+(251)) © G21)) = (+49 G51) G23) = @ 033) 


Before concluding this section, it seems to be worth mentioning that part of the 
definition of a group is redundant. We specify that * is a binary operation on G, 
and then require closure. But closure is part of the definition of a binary operation! 
Nevertheless, it is a good idea to emphasize this point, as closure must be checked 
whenever a new group is defined, and it is easy to forget about it if it is buried inside 
another definition. 


Exercises 


3.7. Give group tables for the following additive groups. 


1. Zs 
2s Z3 x Zo 


3.8. Give group tables for the following groups. 


1. U(12) 
2; aS: 


3.9. Show that G x H is abelian if and only if G and H are abelian. 
3.10. Let G be a group containing at most three elements. Show that G is abelian. 


3.11. Explain why neither of the following is a group. 


1. the set of positive rational numbers under division 
2. the set of rational numbers g > 1 under multiplication 


3.12. Is either of the following a group under addition? 


1. the set of even integers 
2. the set of odd integers 


3.13. Let G = {a+ bi € C: a* +b? = 1}. Is Ga group under multiplication? 
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3.14. Let G be the following subset of Z)5, namely {3, 6, 9, 12}. Show that G is a 
group under multiplication in Z,5. Find the identity, and the inverse of each group 
element. 


3.15. Let p beaprime and G = {a/p” : a € Z,n € N}. Is Gagroup under addition? 


lab 

3.16. Let G be the set of all matrices of the form | 0 1 c |, witha, b,c € Z. Show 
001 

that G is a group under matrix multiplication. Is it abelian? 


3.3. A Few Basic Properties 


Let us begin with a small notational change. Usually, when we are working inside a 
group, we suppress the symbol for the group operation. That is, we write ab instead 
of a * b. The major exception is where the operation is addition, in which case 
this multiplicative notation would be confusing. In that case, we will use additive 
notation and continue to write a + b instead of ab, 0 instead of e and —a instead 
ofa7!. 

In the preceding section, we glossed over the uniqueness of the group identity and 
inverses of group elements. These are important points, if we are to speak of “the” 
identity, or write a~' and have it mean something. Let us take care of this problem. 


Theorem 3.2. Let G be a group. Then 


1. the identity of G is unique; and 
2. ifa € G, thena™ is unique. 


Proof. (1) Suppose that e and f are both identities in G. Then as f is an identity, 
ef =e. But as e is an identity, we also have ef = f. Therefore, e = f. 

(2) Suppose that b and c are both inverses of a. Then as b is an inverse of a, 
(ba)c = ec = c. However, as c is an inverse of a, we have b(ac) = be = b. Given 
that our group operation is associative, b = b(ac) = (ba)c =. 


We know that in any group, (ab)c = a(bc). Thus, we can write abc without 
worrying about ambiguity. But we would like to be able to write abcd, for instance. 
To that end, we have the following result. 


Theorem 3.3. Let G be any group, and a), a2, ..., An € G. Then regardless of how 
the product a, a2 - ++ Ay is bracketed, the result equals (- - - (((a1a2)a3)a4) ++ + An—1) An. 


Proof. Our proof is by strong induction upon n. If7n is 1 or 2, no bracketing is needed, 
so there is nothing to do. When n = 3, this is the associativity from the definition of 
a group. Therefore, let m > 4, and suppose that the theorem is true for any product of 
fewer than n group elements. Take any bracketing of w = a, ---a,, and look at the 
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last operation to be performed. Then w = xy, where x is the product a; --- a, and y 
is the product d@y41 ---d,, each with some bracketing. By our inductive hypothesis, 
X = (++ (a1 2)a3) +++ Gn—1) Am and y =6++ (Gn +14m-4 2)4m t 3) °° Ay—1) an lfm = 
n — 1, then writing xy in this way, we have our desired conclusion. If not, then by 
associativity, 


sy = ((- oe ((a1az)a3) an An)(- ve ((dm +14m4 2)Am } 3) en 7) 1))n. 


Now applying our inductive hypothesis to the product of the first n — 1 terms, we 
obtain the desired bracketing. 


Therefore, we do not have to use brackets when we write a product of group 
elements. However, we must always remember that unless our group is abelian, we 
cannot rearrange terms at will. For instance, (ab)(cd) = (a(bc))d, and we can write 
both as abcd, but we cannot write abcd = cdba. 

Let us also prove a couple of useful facts about inverses. 


Theorem 3.4. Let G be a group, with a,b € G. Then 


1. (a“!)“! =a; and 
2. (ab)! =b"'a“. 


Proof. (1) Since aa~' = a~'a =e, we see from the definition of inverses that the 
inverse of a~! is a. 

(2) Notice that (ab)(b~!a~!) = a(bb“!)a~! = aea~! = aa“! =e and, simi- 
larly, (b~'!a7!)(ab) = e. Therefore, b~'a7! is the inverse of ab. 


Do not make the mistake of thinking that the inverse of ab is a~'b7!! 
In ordinary arithmetic with real numbers, we know that if ab = ac, anda £ 0, 
then b = c. We have something similar for groups. 


Theorem 3.5. (Cancellation Law). Let G be a group and a,b,c € G. If either 
ab = ac or ba = ca, thenb=c. 


Proof. If ab = ac, thena~!ab = a~'ac. As a~'a = e, we have eb = ec, and there- 
fore b = c. The proof is similar if ba = ca. 


When our group has finitely many elements, the cancellation law has important 
implications for the group table. Suppose that, in the row headed by a, the group 
element b occurs twice. Then there exist group elements c and d such thatac = b = 
ad. But then we know that c = d. Therefore, a group element can occur only once 
in each row. In the same way, there will be no repetitions in any column. 


Corollary 3.1. Let G be a group and a, b € G. Then there exists exactly one c € G 
such that ac = b, and exactly one d € G such that da = b. 


Proof. We showed the uniqueness of c and d above. To show the existence of c and 
d, let c=a~'b and d = ba™!. Then ac = aa~'b = eb = b, and da = ba“!a = 
be=b. 
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Example 3.8. Suppose G is a group with four elements, a, b, c and d. If we are 
given the partial group table shown in Table 3.3, we can fill in the missing elements. 
Indeed, examining the first row, we see that ad cannot be b or d. The last column 
tells us that it also cannot be a, so ad = c. As there must be an a in the first row, 
ab = a. Filling in the rest of the table is left as an exercise. 


Table 3.3 Incomplete group table 


Exercises 


3.17. Simplify each of the following expressions as far as possible in an arbitrary 
group G, leaving no brackets. 


1. (acb)(cbab)~! 
2. (a~'bea)~! 


3.18. Repeat the preceding exercise, assuming that G is abelian. 
3.19. Fill in the rest of Table 3.3. 


3.20. Let G = {v, w, x, y, z} be a group with five elements. Further suppose that 
vw = y,vy =v, wx = Zz, xv = w and zw = v. Fill in the group table for G. 


3.21. Show that the following are equivalent for a group G: 


1. for every a, b,c € G satisfying ab = ca, we have b = c; and 
2. G is abelian. 


3.22. Suppose that in the definition of a group, we replace the third part with the 
following weaker condition: 

(3’) There exists an e € G such that for every a € G, ae =a. 

(That is, we do not insist that ea = a.) Show that we still get a group. 


3.4 Powers and Orders 


In group theory, the word order is used in two different, but related, ways. One is 
easy. 
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Definition 3.4. If G is a group, then its order, |G], is the number of elements in the 
set G. We say that G is a finite group if its order is finite; otherwise, it is an infinite 
group. 


Example 3.9. If G = Zs, then |G| = 5, and therefore G is a finite group. On the 
other hand, Z is an infinite group. 


To understand the other use of the word, we need to know about powers of group 
elements. Let G be any group, and a € G. Then for any positive integer n, we let 


a" =aa---a. 
—— 
n times 


(Alternatively, we could define the powers recursively. That is, let a! = a, and for 
each positive integer n, leta”t! = a"a.) Also, leta® = e and, for each positive integer 
n, leta~” = (a”)7!. 


Example 3.10. In U(20), we calculate P=7-7-7=9-7=3. If we wanted to 
know 7~?, then we would calculate (77)~! = 3-! = 7, since 3-7 = 1. 


Powers behave in a rather nice manner, as the following theorem tells us. 


Theorem 3.6. Let G be a group, witha € G, and let m and n be any integers. Then 


1. a™q” = qutn. 
2. (a”)" = a™"; and, in particular, 
3. a" =(a7!)". 


Proof. Exercise 3.26. 


We know that if the group operation is addition, then we will use additive nota- 
tion, rather than multiplicative. In this case, our exponentiation notation would be 
confusing, so we will write things in a more familiar manner. Instead of a”, we will 
write na (that is, add a to itself n times). 


Example 3.11. In Z,2, since our operation is addition, instead of writing 5+, we 
would write 4-5 =5+4+5+5+4+5=8. 


Sometimes, a group will consist only of powers of a specific group element. 


Definition 3.5. A group G is said to be cyclic if there exists an element a such that 
every element of G is a power of a. In particular, we say that G is generated by a, 
and write G = (a). 


Example 3.12. The additive group Z is cyclic; indeed, Z = (1). (Remember, in an 
additive group, the powers are integer multiples, so ifa € Z, thena =a - 1.) In fact, 
Z = (—1) as well, so the generator of the cyclic group is not unique. In the same 
way, Z, is cyclic. 
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Example 3.13. Consider the multiplicative group of complex fourth roots of unity 
discussed in Example 3.4, namely G = {1, —1,i, —i}. Then G is cyclic. Indeed, 
G = (i) since the powers of i are i, —1, —i and 1. 


Not every group is cyclic. For one thing, we have the following fact. 
Theorem 3.7. Every cyclic group is abelian. 


Proof. Let G = (a). If b,c € G, thenb = a” andc = a", for some m,n € Z. Then 
be = aa" = a™*", but ch = a"a™ = at" as well. 


However, abelian groups need not be cyclic. 


Example 3.14. The group U (10) is cyclic, but U (8) is not. To see this, observe that 
U(10) = {1, 3, 7, 9}. But the powers of 3 are 3, 9, 7 and 1, so U(10) = (3). On the 
other hand, U(8) = {1, 3, 5, 7}. But the powers of 1 are all 1, the powers of 3 are 1 
and 3, the powers of 5 are | and 5, and the powers of 7 are | and 7. Therefore, no 
element generates U (8). 


Now, let us discuss the order of a group element. 


Definition 3.6. Let G be a group and a € G. The order of a, denoted |a|, is the 
smallest positive integer n such that a” = e, assuming that such an n exists, in which 
case a has finite order. If no such n exists, then a has infinite order. 


Example 3.15. The identity of a group is the only element having order 1. 


Example 3.16. In S3, the elemento = ¢ : ) has order 3; indeed, 0? = ie : 


whereas o°? is the identity. 


Example 3.17. In Z, every element other than 0 has infinite order. For instance, no 
matter how many times we add 8 to itself, we will never get 0. 


Example 3.18. In Ze, we have |O| = 1, |1| = |5| = 6, |2| = |4| = 3 and |3| = 2. For 
instance, 1-4=4AO0and2-4=2 40, but3-4=0, so |4| = 3. 


The order of an element tells us a great deal about its powers. 


Theorem 3.8. Let G be a group anda € G. Suppose i, j € Z. Then 


1. if a has infinite order, then a' = a! if and only if i = j; and 
2. if|a| =n < ©, thena' =a! if and only ifi = j (mod n). 


Proof. (1) Suppose that a‘ = a/, but i 4 j. Without loss of generality, say i > j. 
Then a! (a/)~' = ai (a/)~! = e. That is, a'-/ = e. But i — j is a positive integer, 
and this contradicts the assumption that a has infinite order. 

(2) Suppose that a! = a/. Once again, a‘~/ = e. Using the division algorithm, 
write i — j =nq +r, withg,r € Zand0 <r <n. Then 
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e=aiJ=a™ = (a")4a’. 


But a” = e. Thus, a” = e. But n is the smallest positive integer having this property, 
andr <n. Therefore, r = 0. That is, n|(i — j), as required. 

Conversely, suppose that i = j (mod n). Then let us write i — 7 = nk, for some 
k € Z. But we now have 


aii = qk — (a")* - ek =e 


Thus, a'~/a/ = ea/, and hence ai = a’. 


Example 3.19. In U(10), we see that |3| = 4. Thus, 3’ = 3/ if and only if 4|(i — j). 
That is, 3° = 3!*, but 3? 4 3!!. 


Example 3.20. In Z, all the integer multiples of 5 are distinct, because 5 has infinite 
order. 


Corollary 3.2. Let G be a group, and let a € G have order n < o. Then, for any 
integer i, 


1. ai =e if and only if nji; and 
2. |a'| =n/(i,n). 


Proof. (1) By the preceding theorem, a' = e = a° if and only if i = 0 (mod n). 
(2) Suppose that, for some positive integer j, we have (a‘)/ = e. We see from (1) that 
since a‘) = e, we have n|ij. Write ij = nk, with k € Z. Letting d = (n, i), we have 
j(@/d) = k(n/d). Now, (n/d,i/d) = 1. Thus, by Corollary 2.2, since n/d|j(i/d), 
we must have n/d|j. Therefore, |a‘| > n/d. But (a')"/4 = ai"/4 = a"“/®, Asi/d is 
an integer, this is (a”)'/4 = e'/4 = e. Thus, |a'| = n/d, as required. 


Example 3.21. Again considering 3 in U(10), we note that 3! = 1 if and only if i is 
a multiple of 4. Also, |3'*| = 4/(4, 14) = 4/2 = 2. 


If G is a group, and a, b € G, then we say that b is a conjugate of a if there exists 
ac € G such that b = c~!ac. 


Theorem 3.9. Jn any group, conjugate elements have the same order. 


1 


Proof. Suppose that b = c~‘ac and that a” = e, for some positive integer n. Then 


b" = (clac)" =c!acc"!acc"!--+cce7!ae 
=c laeae---ac=c'a"c=c lec=e. 
That is, |b] < |a|. But since b = c~'ac, we have a = (c~')~'be!. Thus, by the 
same argument, |a| < |b|. Therefore, |a| = |b]. 
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Exercises 


3.23. Find the order of each group, and the order of every element of each group. 
1. Zi 

2: Zo x Z4 

3.24, Find the order of every element of each group. Is the group cyclic? If so, list 
all generators. 

1. U(14) 

2. 83 

3.25. Let G = (a) be acyclic group of order 20. Find the orders of a?, a!* and a!>. 
3.26. Prove Theorem 3.6. 


3.27. Leta € Gand b € H. Suppose that |a| = 12 and |b| = 18. Find the order of 
(a,b)inG x H. 


3.28. Leta and b be elements of odd order in a group. Show that a” and b? commute 
if and only if a and b commute. Also show that this does not have to hold if a and b 
have even order. 


3.29. Let a and b be elements of a group. Show that the following pairs of elements 
have the same order: 


1. aanda™!; and 
2. ab and ba. 


3.30. Let G = {a,,..., a} bea finite abelian group. Show that a,qa - - - a, has order 
lor 2. 


3.31. Show that it is possible for an abelian group to have exactly three elements of 
order 2, but not exactly four elements of order 2. 


3.32. Suppose that G is a group in which every element has order | or 2. Show that 
G must be abelian. 


3.5 Subgroups 


One of the most important ways of obtaining new groups is to consider subgroups 
of a particular group. 


Definition 3.7. Let G be a group with operation *. Then a subset H of G is called 
a subgroup of G if H is a group under the same operation «. In this case, we write 
H <G. 
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Example 3.22. Every group is a subgroup of itself, and {e} is a subgroup of every 
group. 


When we refer to a proper subgroup of G, we mean any subgroup other than G 
itself. 


Example 3.23. We can see that Z is a subgroup of Q, and both are subgroups of R. 


We do not have to check the entire definition of a group to see if a subset is a 
subgroup. For instance, we know that the group operation is associative on the entire 
group, so it is surely associative on every subset. The following theorem will save 
us some time. 


Theorem 3.10. Let G be a group and H a subset of G. Then H is a subgroup of G 
if and only if 


1. e € H (the subset contains the identity); 
2. ab © H foralla,b € H (the subset is closed); and 
3.a'e A for alla € H (the subset contains all inverses). 


Proof. Let H be a subgroup of G. Then H has an identity, f. Thus, ff = f. But 
also, ef = f. By cancellation, f = e, giving (1). Then, by definition of a group, (2) 
and (3) must hold. 

Conversely, suppose that (1)-(3) hold. We must check that H is a group. But by 
(2), H is closed. As the group operation is associative on G, it is associative on H. 
By (1) and (3), we have an identity and inverses as well. Therefore, H is a subgroup 
of G. 


A remark is in order here. To wit, we could replace condition (1) in the above 
theorem with the weaker condition 


(1’) A is not the empty set. 


Indeed, if a € H, then we see from (3) that a~! € H, and then (2) tells us that 
e = aa! € H. So why not express it that way? Because sometimes, the subset we 
are checking is not a subgroup. And we can tell immediately that that is the case if 
the subset does not contain e. 


Example 3.24. The set of all even integers, 2Z, is a subgroup of Z. Indeed, we 
certainly have 0 = 2-0 € 2Z. If 2m, 2n € 2Z, then 2m + 2n = 2(m +n) € 2Z, so 
we have closure. Finally, if 2m € 2Z, then its inverse is —(2m) = 2(—m) € 2Z, and 
we have inverses. Of course, there is nothing magical about the number 2 here. If a 
is an integer, than aZ is a subgroup of Z. 


In fact, this last example is a specific case of a more general phenomenon. We 
have already encountered cyclic groups. 
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Definition 3.8. Let G be a group and a € G. Then the cyclic subgroup generated 
by a is the set of all powers of a in G, and we write 


(a) = {a" :neéZ}. 


Of course, the group G is cyclic if and only if there exists an a € G such that 
G = (a). 


Theorem 3.11. Jf G is a group anda € G, then (a) is a subgroup of G. 


Proof. Certainly e = a° € (a). Take any a”, a” € (a). Then aa" = at" € (a). 
Finally, if a” € (a), then (a")~! = a~™ € (a). Now apply Theorem 3.10. 


Example 3.25. In U(10), the powers of 3 are 1, 3, 9 and 7, so (3) = {1,3, 7,9} = 
U (10). Similarly, (7) = U(10). But the only powers of 9 are | and 9, so (9) = {1, 9}. 
Also, (1) = {1}. 


Example 3.26. In Zp, the multiples of 8 are 1 - 8 = 8,2-8 = 4and3- 8 = 0. Thus, 
we have (8) = {0, 4, 8}. 


Of course, we do not insist upon commutativity in groups, but it can be useful to 
know which elements commute with everything. 


Definition 3.9. If G is a group, then the centre of G, denoted Z(G), is the set of 
elements of G that commute with everything in G. That is, Z(G) = {z € G:az= 
za for all a € G}. 


Example 3.27. If G is abelian, then Z(G) = G. 


Example 3.28. The centre of 53 is the trivial subgroup, {e}. Verifying this is a matter 
of considering each element of $3 other than the identity, and finding another element 
that does not commute with it. 


Example 3.29. The centre of GL2(R) is the set of all matrices ( a where 0 4 


a € R. We leave the proof as Exercise 3.36. 
Theorem 3.12. If G is a group, then Z(G) is a subgroup of G. 


Proof. Certainly ea = a = ae for alla € G, soe € Z(G). If y,z € Z(G) anda e 
G, then yza = yaz = ayz; thus, yz € Z(G). Also, if z € Z(G) and ae G, then 
a~'z=za™!. Inverting both sides, we get z~!'a = az~!. Thus, z~! € Z(G). The 
proof is complete. 


Some shortcuts are possible when it comes to testing whether a subset is a sub- 
group. 
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Theorem 3.13. Let G be a group and H a subset of G. Then H is a subgroup if and 
only if 

I. e€ H; and 

2. ab~! € H whenever a,b € H. 


Proof. Suppose that H is a subgroup. By Theorem 3.10, we know that e € H and if 
a,b € H, then b~! € H, and therefore ab~! € H. 

Conversely, suppose that H satisfies (1) and (2). Take any a, b € H. Then since 
e € H, we have ea“! = a™! € H and, similarly, b~' € H. Therefore, a(b-!)~'! = 
ab € H. In view of Theorem 3.10, H is a subgroup. 


Once again, instead of checking that e € H, it is enough to verify that H is not 
empty. We can even make things simpler if H is a finite set. 


Theorem 3.14. Let G be a group and H a finite subset of G. Then H < G if and 
only if 


l. e€ H; and 
2. ab € H whenevera,b «€ H. 


Proof. If H is a subgroup of G, then Theorem 3.10 tells us that (1) and (2) hold. 
Conversely, suppose that (1) and (2) are true. By Theorem 3.10, we only need to 
show that if a € H thena™! € H. In view of (2), we have aa = a? € H, and hence 
a’a = a> € H, and so on; thus, a” € H for all positive integers n. But there are 
infinitely many such powers, and H is finite. Thus, there exist positive integers m 


and n, with m > n, such that a” = a”. Thena””"" = e.Ifm—n=1,thena =e, in 


which casea~! = e € H.So, suppose thatm —n > 1.Thenaa”™-""! = a"-""'q = 
a" = e. That is,a”~""! = a~!. But m — n — 1 isa positive integer, and therefore 
a”""—! € H, as required. 


We must be careful only to use the above theorem when H is finite. To see why, 
let G be the additive group of integers, and let H be the set of nonnegative integers. 
Then # contains 0 and is closed under addition, but H is not a subgroup of G, since 
1 has no additive inverse. 


Example 3.30. LetG = Zg x Zs, andlet H = {(a, b) € G: 4a = 0}. Weclaim that 
H is asubgroup of G. Clearly, (0, 0) € H. Also, if (a, b), (c,d) € H, then (a, b) + 
(c,d) = (a+c,b+d), where 4(a +c) = 4a+4c =0+0=0. Therefore, H is 
closed, and hence a subgroup. 


We conclude the section with an extended, and important, example. Suppose we 
have a floor consisting of featureless square ceramic tiles. Let us pry up one of the 
tiles, and then consider all of the ways in which we can move the tile around in 
three-dimensional space, and then replace it so that it looks exactly as it did when 
we began. For convenience, let us label the vertices of the square 1, 2, 3 and 4. 
Then we can see that each vertex moves to the position of some vertex. Also, two 
vertices will not move to the same place. Once we have positioned the vertices, we 
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Fig. 3.1 The symmetries in the dihedral group Dg 


are done. Therefore, these symmetries of a square can be regarded as permutations 
of the set {1, 2, 3, 4}; that is, as elements of S4. Of course, the identity of S4 is such 
a symmetry, and if we compose two of these symmetries, then we get another. Thus, 
by Theorem 3.14, they form a subgroup of S4, known as the dihedral group of order 
8, and denoted Dg. 

What are the elements of Dg? They are illustrated in Figure 3.1. There are four 
rotations, Ro, Roo, Rigo and R279, where Ry is a counterclockwise rotation by a 
degrees. We also have four flips, F; through F4, about the lines shown in the diagram. 

And that is all! Indeed, vertex | can go to any of the 4 vertices, but then vertex 2 
must be adjacent to it, not diagonally opposite. Once vertices | and 2 are positioned, 
the others fall into place. Therefore, |Dg| = 8. The group table of Dg is shown in 
Table 3.4. 

Remember that when we write Roo F; = F3, we mean perform F; first, then Roo. 
We note that Rog F; 4 F; Roo, and therefore Dg is a nonabelian group of order 8. In 
fact, a quick glance through the table tells us that the centre of Dg is {Ro, Rigo}. 


3.5 Subgroups 53 


Table 3.4 Group table for the dihedral group Dg 


Ro Roo Riso R20 MO OF OM 
Ro | Ro Roo Riso Ro Fi RO 
Roo | Roo Riso R200 Ro Fe Fe mF 
Rigo}Riso Ro70 Ro Ro hh Mh hm 
Ro70|R270 Ro Roo Rigo Ky Fs OR OD 
AiR & KH FPF Ro Riso R270 Roo 
Fy |}ky F FL Fy Rigo Ro Roo R270 
Py) Fy Fi Fa Fy Roo R270 Ro Riso 
Fy | Fs Fy FR Fi Ro70 Roo Rigo Ro 


We do not have to begin with a square. Indeed, let us consider any regular n-gon, 
with n > 3. Then the symmetries of this n-gon form a subgroup of S,,. By precisely 
the same arguments as above, it will consist of n rotations and n flips. (There are n 
possible locations for a given vertex, and once it is fixed, 2 choices for an adjacent 
vertex. After fixing those vertices, there are no choices remaining.) We call this group 
of symmetries the dihedral group of order 2, and denote it by D2,. In particular, if 
n = 3, we note that Dg consists of all of S3, but for larger n, Dz, is a proper subgroup 
of S,. In any case, we now have an example of a nonabelian group of every even 
order except 2 and 4. 


Exercises 
3.33. In each case, is H a subgroup of G? 


1. G = GL2(R), A is the set of matrices with determinant | 
2. G = Dy, FH is the set of flips 
3. G=Q,H= {a/b: a,b €Z,2¢ b} 


3.34. In each case, is H a subgroup of G? 


1. G = Do, A is the set of rotations 

2. G=Q,) A is the set of nonnegative rational numbers 

3. Gis the multiplicative group of nonzero rational numbers, H is the set of positive 
rational numbers 


3.35. For each positive integer n > 3, determine the centre of Do,. 


3.36. Show that the centre of GL2(R) consists of the matrices ( a for allO # 


aeéeR. 


3.37. Show that the intersection of two subgroups of G is also a subgroup. Then 
extend this to show that if N; is a subgroup of G for every i in some set T, then 
(lier Ni is a subgroup of G. 


3.38. Let H and K be subgroups of G. Show that H U K is a subgroup of G if and 
only ifeither H C K or K CH. 


3.39. Find every cyclic subgroup of each of the following groups. 
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1. Zoo 
2. U(16) 


3.40. Let G be an abelian group and n € N. Let H = {a € G: a" =e} and K = 
{a” : a € G}. Show that H and K are subgroups of G. 


3.41. In any dihedral group, show that a rotation followed by a rotation, or a flip 
followed by a flip, is a rotation, whereas a rotation followed by a flip or a flip followed 
by a rotation is a flip. 


3.42. Let G be the set of all sequences of integers (a), a2, a3, ...). 


1. Show that G is a group under (qj, a2,...) + (bi, b2,...) = (ai +)1,a.+ 
bo, ...). 

2. Let H be the set of all elements (a1, az, ...) of G such that only finitely many a; 
are different from 0 (and (0, 0, 0, ...) € H). Show that H is a subgroup of G. 


3.6 Cyclic Groups 


Cyclic groups have a very straightforward structure. Let us prove a few basic facts. 
First, we can illustrate the link between the order of an element and the order of a 


group. 


Theorem 3.15. Let G = (a) be cyclic. If a has infinite order, then all powers of a 
are distinct. If \a| =n < 00, then the distinct elements of G are e, a, a®*,..., a", 
In particular, |a| = |(a)|. 


Proof. If a has infinite order, then we can use Theorem 3.8. Suppose |a| = n < oo. 
If m € Z, then write m = ng +r, where g,r € Zand0O <r <n. Thenasm=r 
(mod n), Theorem 3.8 tells us that a” = a’. In particular, every element of G is equal 
to some a! (0 <i <n). Now, suppose that a! = a/, with 0 <i < j <n. Then by 
Theorem 3.8, i = j (mod 7). But given the range of values for i and j, this is 
impossible. 


The subgroups of cyclic groups are also easy to determine. 
Theorem 3.16. Every subgroup of a cyclic group is cyclic. 


Proof. Let G = (a), and let H < G.If H = {e}, then H = (e), and we are done, so 
assume that H is not the trivial subgroup. Then H contains a’, for some nonzero 
integer m. If m < 0, then H also contains (a’")~'! = a~", so H contains a positive 
power of a. Let n be the smallest positive integer such that a” € H. We claim that 
H = (a"). Surely H contains every power of a”, so (a") < H. But suppose a‘ € 
H. Then write k = ng +r, with g,r € Zand0 <r <n. Now, H contains a‘ and 
(a")~4, and therefore a‘ (a")~4 = a*‘—"4 = a". But n is the smallest positive integer 
such that a” € H. Asr <n, wecan only have r = 0. Thus, a‘ = (a”)4 € (a”). That 
is, H < (a”), proving the claim. We are done. 
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Actually, we can say more. 


Corollary 3.3. Let G = (a), where |a| =n < oo. Then the order of every subgroup 
of G is a divisor of n. Furthermore, if m is a positive divisor of n, then G has exactly 
one subgroup of order m, namely (a"/"), 


Proof. By the preceding theorem, every subgroup is of the form (a“), forsomek € Z. 
But Corollary 3.2 tells us that the order of every power of a is a divisor of n. 

Let m be a positive divisor of n. Again using Corollary 3.2, we see that |a”/”"| = 
n/(n,n/m) =n/(n/m) = m. Thus, (a”/"") is indeed a subgroup of order m. Let us 
check that it is unique. Suppose that (a*) is a subgroup of order m. Then |a*| = m, and 
so (a‘)” = e. That is, a*” = e, hence n|km. But then n/m|k. Writing k = (n/m)i, 
with i € Z, we have a‘ = (a"/™)! € (a"/"). Thus, (a*) < (a"/"). But these two 
subgroups have the same order. Therefore, they are equal. 


Example 3.31. Let G = (a), where |a| = 20. Then G has exactly one subgroup of 
8 gl? qo), 


order 5, namely (a*) = {e,a*, a’, al, a 
Thus, acyclic group can only have one subgroup of any given order. This is a special 
property of cyclic groups; indeed, Dg and U(8) are easily seen to have several 
different cyclic subgroups of order 2. 

We can also discuss the number of elements of a particular order in a cyclic group. 
Some notation will be helpful. The following function is named after Leonhard Euler. 


Definition 3.10. The Euler phi-function is a function g : N > N, where y(n) is 
the number of positive integers less than or equal to n that are relatively prime to n. 


Example 3.32. Of the integers from | to 10, only 1, 3, 7 and 9 are relatively prime 
to 10, so g(10) = 4. The first few values of g are given in Table 3.5. 


From the definition of the group U(n), we immediately obtain the following. 
Theorem 3.17. For any positive integer n, |U(n)| = y(n). 


But we can also use the Euler function to count the elements of a particular order 
in a finite cyclic group. 


Theorem 3.18. Let G = (a) be acyclic group of order n. Let m be a positive divisor 
of n. Then the number of elements of order m in G is p(m). 


Proof. If bis an element of order m in G, then (b) must be the unique cyclic subgroup 
of order m. That is, all of the elements of order m in G are in the cyclic subgroup 
of order m. Thus, we may as well assume that G is cyclic of order m. We must 


Table 3.5 Values of the Euler phi-function 


n [1234567891011 12 13 1415 16 
p(n)J1 12242646 4 104126 8 8 
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therefore decide which elements of this group have order m. But by Corollary 3.2, 
the order of a‘ is m if and only if (k, m) = 1. By definition, the number of such k, with 
1<k <™m, is g(m). (By Theorem 3.15, the elements of (a) are precisely a’, with 
0 <i <™m, butasa® = a” =e, this is the same as considering a with 1 <i <m.) 


Example 3.33. Let G = (a) be cyclic of order 50. Then we know that there are 
g(10) = 4 elements of order 10 in G. They lie in the subgroup of order 10, namely 
(a>°/!) — (a>). Indeed, the precise elements will be (a°)*, where (k, 10) = 1. This 
means that k € {1, 3, 7, 9}, so the elements of order 10 are a>, a!°, a®* and a*. It is 
worth noting that the number of elements of order 10 in a cyclic group of order one 
million is also g(10) = 4. 


For relatively small numbers, g(7) is easy to determine, but for large n, it would be 
tedious to go through all the numbers from | to n in order to see if they are relatively 
prime to n. Happily, there is a shortcut. The first part of the following theorem is 
Exercise 3.45. It will make more sense if we postpone the proof of the second part 
until Section 4.4. 


Theorem 3.19. Let p be a prime number, and let m and n be positive integers. Then 
1. g(p") = p" — p"™!; and 
2. if (m,n) = 1, then g(mn) = g(m)eg(n). 

Thus, we can determine g(n) by writing n as a product of powers of primes and 
then using the above theorem. 


Example 3.34. We have v(81) = 81 — 27 = 54 and (540) = 9(4)9(27)9(5) = 
(4 — 2)(27 — 9)(5 — 1) = 144. 


Exercises 


3.43. 1. Let G = (a) be acyclic group of order 12. List every subgroup of G. 
2. List every subgroup of Zj2. 


3.44. 1. Let G = (a) be acyclic group of order 120. List all of the elements of 
order 12 in G. 
2. How many elements of order 12 are there in a cyclic group of order 1200? 


3.45. Let p be a prime and na positive integer. Show that g(p") = p” — p"!. 


3.46. Find all positive integers n such that |U (n)| = 24. 


3.47. Let G be a nonabelian group. If H and K are cyclic subgroups of G, does 
it follow that HM K is also a cyclic subgroup? Prove that it does, or provide a 
counterexample. 


3.48. Let G = (a) be infinite cyclic. If m and n are positive integers, find a generator 
for (a) MN (a"). 
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3.49. Let n be a positive integer and let T be the set of positive integers that divide 
n. Show that )°,<7 v(k) =n. 


3.50. For precisely which positive integers n is U(2") cyclic? 
3.51. Let G be any group and n a positive integer. 


1. If A and K are subgroups of order n in G, and H # K, show that HM K does 
not contain any elements of order n. 

2. Show that the number of elements of order n in G is either a multiple of g(7) or 
infinite. 


3.52. Show that a nontrivial group G has no nontrivial proper subgroups if and only 
if G is cyclic of prime order. (Do not assume, to begin with, that G is finite.) 


3.7 Cosets and Lagrange’s Theorem 


One important fact we learned in the preceding section is that if G is a finite cyclic 
group, then the order of every subgroup of G divides the order of G. As it turns out, 
this is true for all finite groups, but a different proof will be required. To this end, we 
need some new terminology. 


Definition 3.11. Let G be a group and H a subgroup. If a, b € G, we say that a is 
congruent to b modulo H, and we write a = b (mod H), if a~'b € H (or, in the 
case of an additive group, if —a +b e€ H). 


Example 3.35. Let G = Z and H = 5Z. Then as —1 + 16 = 15 € H, we see that 
1 = 16 (mod #). In this particular case, the notion is identical to congruence 
modulo 5. 


Example 3.36. Let G = U(20) = {1,3, 7,9, 11, 13,17, 19}, and let H = (3); 
namely, H = {1,3, 7,9}. Then we note that 13-!.19=17-19=3 € H. Thus, 
13 = 19 (mod A). 


Lemma 3.1. Let G be a group and H a subgroup. Then congruence modulo H is 
an equivalence relation on G. 


Proof. Reflexivity: If a € G, then a~'a = e € H, and therefore a =a (mod H). 
Symmetry: If a,beG and a=b (mod A), then a~'b € H, and therefore 
(a~'b)~! = b~'a lies in H as well. But this means that b =a (mod #). Transi- 
tivity: Suppose that a, b,c € G, where a = b (mod A) andb=c (mod H). Then 
a~'b, b~'c € H. Butin this case, H contains their product, a~'!bb~'c = a~'c. Thus, 
a=c (mod #). We are done. 


What are the equivalence classes? 


Lemma 3.2. Let G be a group and H a subgroup. If a € G, then its equivalence 
class with respect to congruence modulo H is the set {ah :h € H}. 
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Proof. lf a = b (mod H), then a~'b € H, so a~'!b = h, for some h € H. Thus, 
b = ah, which is in our set. Conversely, if b = ah, for some h € H, then a 'b= 
h € H, and therefore a = b (mod #). 


We need a name for this set. 


Definition 3.12. Let G be a group, H < G anda e€G. Then the left coset of a 
with respect to H is the set {ah : h € H}, which is denoted aH. (Note: If the group 
operation is addition, then we will writea + H = {a+h:h e€ H}.) 


Example 3.37. If G = U(20), let H = (9) = {1,9}. Then 3H = {3-1,3-9} = 
{3, 7}. Also, 7H = {7-1, 7-9} = {3, 7}, so 3H = 7H. Furthermore, 1H = 9H = 
H,11H = 19H = {11, 19} and 13H = 17H = {13, 17}. Note that these left cosets 
partition G. 


Example 3.38. Let G = Z and H = 3Z. Then there are three left cosets: 0+ H = 
HA, 1+H ={---,—5,—-2,1,4,7,...} and 2+ H ={--- ,—4,—-1,2,5,8,...}. 
Note that2 + H =5+ H = —13+ H,andso on. Again, the left cosets partition G. 


In general, we know that equivalence classes always partition a set. Therefore, 
we can record the following result. 


Theorem 3.20. Let G be a group and H a subgroup. Then the left cosets of H in G 
partition G. In particular, 


I. eacha € G is in exactly one left coset, namely aH; and 
2. ifa,b €G, then eitheraH = bH oraHN bH=@. 


Two points should be kept in mind here. First, left cosets are not subgroups! 
Remember, the left cosets partition G, and therefore the identity can only be in one 
of them, namely, eH = H. The rest cannot possibly be subgroups. Second, as we 
have already seen, when we write aH, the element a is not unique. Indeed, since the 
left cosets are equivalence classes, we have aH = DH if and only ifa~'b € H. 

We can now prove our first big result on finite groups, due to Joseph-Louis 
Lagrange. 


Theorem 3.21. (Lagrange’s Theorem). Let G be a finite group and H a subgroup. 
Then || divides |G|. 


Proof. We have already seen that G is partitioned into left cosets; in particular, |G| is 
the sum of the sizes of these left cosets. But foranya € G,aH = {ah:h € H}.Now, 
if ah, = aho, withh,, hz € H, then by the cancellation law, h; = hz. Therefore, aH 
consists of precisely | H| distinct elements. It now follows that the order of G is |H 
multiplied by the number of left cosets. In particular, | H| divides |G]. 


Definition 3.13. Let G be a group and H < G. Then the index of H in G, denoted 
[G : H], is the number of left cosets of H in G. 


Corollary 3.4. If G is a finite group and H is a subgroup, then|G : H| = |G|/|H|. 
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Proof. This is immediate from the proof of the above theorem. 


Example 3.39. Let G= Dg and H= (Roo) = {Ro, Roo, Rigo, R70}. Then 
[G : H] = |G|/|H| = 8/4 = 2. Thus, there are two left cosets. One is RoH = H. 
The other must be F, H = {F), Fo, F3, Fy}. If K = (F,) = {Ro, Fi}, then it must 
have 8/2 = 4 left cosets. One is RoK = K. To find another, just choose an element 
of G that we have not yet found, say Roo. Then we get Ron K = { Roo, F3}. We haven’t 
yet used Fy, so take Fy K = {F), Rigo}. Finally, we can take Royo K = {Ro70, Fy}. 


Example 3.40. Note that the subgroups of an infinite group can be of finite or infinite 
index. For instance, we saw above that 0+ 3Z, 1+ 3Z and 2+ 3Z are the distinct 
left cosets of 3Z in Z. Thus, [Z : 3Z] = 3. On the other hand, Z has infinite index 
in Q. To see this, observe that for all positive integers n, the left cosets 1/n + Z are 
distinct. And there are still more! 


Lagrange’s theorem has a beautiful consequence. 


Corollary 3.5. Let G be a finite group, and a € G. Then the order of a divides the 
order of G. 


Proof. The order of a is the order of the cyclic subgroup generated by a, and that 
must divide the order of G. 


Example 3.41. Note that |Dg| = 8, the identity has order 1, Rigo and the flips all 
have order 2 and | Roo| = |R270| = 4. All of the orders are divisors of 8. 


Of course, it does not follow that because a number n divides the order of a group, 
then the group has an element of that order. Indeed, if that were always true, then a 
group of order n would have to have an element of order n, and therefore every finite 
group would be cyclic, which is not the case. 

One important thing that we can do is to try to classify all the groups of some 
particular order. We can now make a step in that direction. 


Corollary 3.6. Every group of prime order is cyclic. 


Proof. Take e 4a € G, where |G| is a prime. As |a| divides |G|, and |a| ~ 1, we 
must have |a| = |G|. But then |G| = |(a)|, and therefore G = (a). 


Not surprisingly, there is also such a thing as a right coset. Indeed, if we had 
defined a = b (mod H) to mean that ab~! € H, then we would have found that 
this is still an equivalence relation, and the equivalence classes would have been as 
follows. 


Definition 3.14. Let G be a group and H < G. Then for any a € G, the right coset 
of a with respect to H is Ha = {ha:h € H}. (If G is an additive group, then we 
write H+a={h+a:he H}.) 


If G is abelian, then there is no distinction between left and right cosets. In 
nonabelian groups, right cosets also partition G, but possibly in a different way. 
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Example 3.42. Take G, H and K as in Example3.39. Then we can see that 
one right coset of H in G is HRo = H = RoH and the other must be HF; = 
{F,, Fo, F3, Fs} = FH. Here, the left and right cosets agree. But it is not the same 
for K. For instance, Rog K = {Roo, F3}, but K Roo = { Roo, F4}. 


Would it have made a difference if we had defined the index of H in G using 
right cosets instead of left? Fortunately, no. This is clear if G is finite, as Lagrange’s 
theorem works equally well using right cosets. But what if G is an infinite group 
having a subgroup H of index n < 00? Then notice that aH = bH if and only if 
a'b € H, but also Ha~! = Hb™' if and only if a~'b € H. Thus, if the distinct 
left cosets of H in G are a, H,a2H,...,a,H, then the distinct right cosets are 
Ha; Hay", 2 ees Ha;'. 


Exercises 

3.53. For each group G and subgroup H, find all the left cosets and right cosets of 
HinG. 

1. G=Z,H =4Z 

2. G= Dg, H = (Fh) 


3.54, For each group G and subgroup H, find all the left cosets and right cosets of 
HinG. 


1. G=U(13), H = (8) 


123 
2 Gas. =((123) 


3.55. Let G be a group whose order is the product of two (not necessarily distinct) 
primes. Show that every proper subgroup of G is cyclic. 


3.56. Let G be a group of order p”, for some prime p and positive integer n. Show 
that G has an element of order p. 


3.57. Let G be a group having a subgroup H of order 28 and a subgroup K of order 
65. Show that HM K = {e}. 


3.58. Let G be a finite group having an element of order k, for each 1 < k < 10. 
What is the smallest possible order of G? Show that a group of that order exists 
having this property. 

3.59. Let G = {a),..., az} be an abelian group of odd order k. Show that 
a1d2°::ay =e. 

3.60. Show that every group of order 55 contains an element of order 5 and an 
element of order 11. 


3.61. Let G be a group with subgroups H and K. If [G : K] =n, show that [H : 
HK] <n. 


3.62. Let G be a group with subgroups H and K such that K < H. Suppose that 
[G : H] =mand[H: K] =n. Show that [G : K] = mn. (Do not assume that G is 
finite.) 


Chapter 4 ®) 
Factor Groups and Homomorphisms si 


In the previous chapter, we tended to consider just one group at a time. But we need to 
find ways of relating groups to each other. For instance, we would like to know if two 
groups are, in every meaningful sense, the same. This would be the case if we took 
a group and created a new one by simply changing the labels on the group elements, 
but left the structure otherwise intact. Surely, we would not wish to think of these as 
different sorts of groups.' This is where the notion of a group homomorphism and, 
in particular, an isomorphism, will come into the picture. 

But first, we will discuss factor groups. These constitute an important way of 
creating new groups from old ones. As we shall see, there is a natural connection 
between factor groups and homomorphisms. In order to define a factor group, we 
require a special sort of subgroup, called a normal subgroup. Let us begin there. 


4.1 Normal Subgroups 


Let H bea subgroup of G. We would like to form a group whose elements are the left 
cosets aH. Unfortunately, as we shall see in the next section, not just any subgroup 
will suffice; we need an extra condition. This is where normal subgroups come in. 

Recall that if H < G, then the left cosets of H do not necessarily coincide with 
the right cosets. We need to consider subgroups for which they do coincide. 


Definition 4.1. Let G be a group and N a subgroup. We say that NV is a normal 
subgroup of G if aN = Na forallae G. 


Example 4.1. For every group G, G is anormal subgroup of itself, asaG = Ga = G 
for all a. Also, {e} is normal. Indeed, a{e} = {e}a = {a} for all a. 


‘Upon reading this sentence aloud, the author failed to stop himself from writing “And don’t call 
me Shirley.” We miss you, Leslie Nielsen! 
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Example 4.2. The centre of every group is a normal subgroup. Indeed, writing Z = 
Z(G), we have aZ = {az : z € Z} = {za: z € Z} = Za. In fact, every subgroup of 
Z(G) is normal in G, for precisely the same reason. In particular, every subgroup of 
an abelian group is normal. 


Be warned: this last example can be a bit misleading. Remember, when we say 
that aN = Na, we do not necessarily mean that an = na for all n € N. Indeed, we 
could have an = n,a, for some different n} € N. The following example may be 
helpful. 


Example 4.3. Refer to Example 3.42. We saw that in Dg, the subgroup (Roo) is nor- 
mal. That is, a(Ro9) = (Roo)a for all a € Dg. This does not mean that aRo9 = Rooa, 
however. Indeed, F|Ro9 = R270F 1. But as R279 € (Roo), this is fine. We also saw in 
that example that (F;) is not a normal subgroup of Dg, as Roo(F1) 4 (F) Roo. 


There is one special case in which we do not need to worry about normality. 
Theorem 4.1. /f G is a group, then any subgroup of index 2 is normal in G. 


Proof. Let H be a subgroup of index 2. Then one of the left cosets is H, and the 
other must consist of everything outside of H. In particular, aH = H if a € H and 
aH is the other left coset if a ¢ H. But exactly the same thing can be said for right 
cosets! So the left and right cosets agree. 


Example 4.4. It is worth noting that if N is a normal subgroup of G and H is a 
normal subgroup of N, it does not necessarily follow that H is normal in G. For 
instance, N = {Ro, Rigo, Fi, F2} is a normal subgroup of Dg. (To check that it is 
a subgroup, use Theorem 3.14. To check that it is normal, use Theorem 4.1.) Also, 
H = (F}) is normal in N. (Again, it has index 2.) But as we saw in Example 4.3, H 
is not normal in Dg. 


Let us define a new subgroup. 


Definition 4.2. Let H be a subgroup of G. Then for any a € G, we write a~'Ha = 
{a 'ha:heé H}. 


Theorem 4.2. [fH is a subgroup of G anda € G, then a~'Ha is a subgroup of G. 
Furthermore, \|a~'|Ha| = |H |. 


Proof. We have e € H, and therefore e = a~'ea € a~'Ha. If a~'hya,a“!ma € 
a~'Ha, then 


(a thya)(a tha) = a thy (aa! )na = at hyehna = a hina € a'Ha, 


since hh) € H. Finally, if a~'!ha € a~'Ha, then (a~'ha)~'! = a~'h-'a € a7'Ha, 
since h-! € H. Thus, a~'Ha is a subgroup of G. Also, given the definition of 
a~'Ha, it is clear that we can only get one element for each element of H. But if 
a~'ha = a~'hga, then by cancellation, h, = hy. Thus, |a~'Ha| = |H|. 
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We can use this to give several different ways of saying that a subgroup is normal. 


Theorem 4.3. Let G be a group and H a subgroup. Then the following are equiva- 
lent: 


1. H is normal in G; 

2. aha €H forallh € H andallae G; 
3. a-'Ha CH forall a € G; and 

4. a 'Ha=H forallae G. 


Proof. Itis clear that (4) implies (3) and (3) implies (2). Let us show that (2) implies 
(1). Suppose that (2) holds. Take any a € G. Then for any h € H, we have a~'ha = 
hy, for some h, € H. Thus, ha = ah; € aH.Thatis, Ha C aH. Also, (a~!)~!ha7! = 
hy, for some hy € H. That is, aha~! = hy, and therefore ah = hoa € Ha. Thus, 
aH C Ha, so aH = Ha and (1) is proved. 

Finally, let us show that (1) implies (4). Let H be a normal subgroup of G. Take 
any a € G. Then Ha = aH. Thus, for any h € H, we have ha € aH, and therefore 
ha = ah, for some h, € H. That is, a~'ha = h, € H. Therefore, a~'Ha C H. But 
using ain place of a, we also get aHa"' C H. Hence, if h € H, then aha7! = ho, 
for some hy € H. But now h = a~!hoa € a~'Ha. That is, H C a~'Ha, and we are 
done. 


Example 4.5. Let SL,(R) denote the set of all matrices in GL, (IR) having determi- 
nant 1. We call this the special linear group. In view of Exercise 3.33, we know 
that SL,(R) is a subgroup of GL,(R). In fact, it is a normal subgroup. Indeed, if 
A € SL,(R) and B € GL, (R), then 


det(B~'AB) = det(B™') det(A) det(B) = det(B™') det(B) det(A), 


since the determinants are just real numbers. But this is det(B~'B) det(A) = 1, since 
B~'B is the identity matrix and det(A) = 1. Therefore, B~'AB € SL,(R), and by 
Theorem 4.3, SL, (IR) is indeed normal. 


Another useful construction is the following. 


Definition 4.3. If H and K are subgroups of G, then we write HK = {hk : he 
H,k € K}. (Ifthe group operation is addition, write H+K = {h+k:heH,k € K}.) 


Note that HK is a subset of G, not necessarily a subgroup! It is easy to come up 
with examples where HK is not a subgroup, but the following theorem will lead us 
to some that cannot possibly work. 


Theorem 4.4. [fH and K are finite subgroups of a group G, then 


| ||K| 


|HK| = 
lH NK| 
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Proof. Considering all possible h € H andk ¢€ K, it is clear that we can produce 
at most |H||K| elements hk, but we must determine how many times each unique 
group element appears in such a list. Note that if hk, = hok2, with h; €¢ A and 
k; € K, then hs'hy = koky! € HK. Thus, hy = ng and kz = gk, for some 
g © HK. Conversely, if hy = hog and kz = gk, with g e HM K, then hk; = 
hyogk; = hog o'ky = hyk». In other words, each hk will occur once for every element 
of H 1 K. The result follows. 


Example 4.6. Let G = $3 and let H and K be any two different subgroups of order 2. 
Then H MK can only contain the identity, and therefore |HK| = 4. But by Lagrange’s 
theorem, a group of order 6 cannot have a subgroup of order 4. Therefore, HK is not 
a subgroup. 


But HK will be a subgroup if either H or K is normal. 


Theorem 4.5. Let H and K be subgroups of G. Then 


1. if either H or K is normal in G, then HK is a subgroup of G; and 
2. if both H and K are normal in G, then HK is normal as well. 


Proof. (1) Observe that e = ee € HK. Suppose that H is normal. Let us show 
closure. If h; € H and k; € K, then 


(hik1) (hok2) = hy (kyhoky')kiko. 


Since H is normal, kihoky' € H, and therefore hykyhok,' € H, kjky © K, as 
required. Also, 


(ka) | = ky thy’ = (ey hy ek. 
Again, since H is normal, ky thy tk € H, so (hik,)~! € HK. If K is normal, the 
proof is similar and left as an exercise. 


(2) Take hh € H,k € K anda e€ G. Then 


a 'hka = (a“'ha)(a~'ka). 


But a~!ha € H and a7'ka € K. Thus, a~'hka € HK. 


Exercises 
4.1. Is each of the following sets a normal subgroup of GL, (R)? 
1. H = {A € GL,(R) : det(A) € Q} 


2. the set of diagonal matrices (( ;) in GL,(R) 


4.2. Find every normal subgroup of 53. 


4.3. If N is a normal subgroup of G, and |N| = 2, show that N < Z(G). 
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4.4. Let N be anormal subgroup of G. Let H be the set of all elements h of G such 
that hn = nh for all n € N. Show that H is a normal subgroup of G. 


4.5. Show that the intersection of two normal subgroups of G is also a normal 
subgroup. Then extend this to show that if N; is a normal subgroup of G for every i 
in some set T, then ();.7 Nj is a normal subgroup of G. 


4.6. Let N; < No < N3 <.--- be normal subgroups of G. Show that ear N; isa 
normal subgroup of G. 


4.7. Let G be a group having exactly one subgroup H of order n. Show that H is 
normal in G. 


4.8. Let G = H x K.If N is a normal subgroup of H and L is a normal subgroup 
of K, show that N x Lis anormal subgroup of G. Is every normal subgroup of G of 
this form? 


4.9. Suppose that H is a subgroup of G and a~'b~!ab € H, for alla, b € G. Show 
that H is normal. 


4.10. Let H and K be subgroups of G. Show that HK is a subgroup if and only if 
HK = KH. 


4.2 Factor Groups 


We are now in a position to construct a new sort of group. 


Definition 4.4. Let G be a group and N a normal subgroup. Then the factor group 
(or quotient group) G/N is the set of all left cosets aN, with a € G, under the 
operation (aN)(bN) = abN. 


The fact that the factor group is indeed a group needs proving. Then we can look 
at some examples. 


Theorem 4.6. /f G is any group and N is anormal subgroup, then G/N is a group 
of order [G : N]. 


Proof. The main point is to verify that the operation is well-defined. The rest will 
follow easily from the fact that G is a group. In other words, suppose that ajN = a,N 
and b;N = b2N. We must show that a,;b;N = ayb2N. Otherwise, this operation is 
nonsensical. But as a;N = a2N, we have a, ay = n,, for some n; € N. Similarly, 
by 'by = ny € N. Then 


(aby) ‘ayby = by ay !arby = by my by = (by) 1b) (bp 'b2) = by ni bind. 


Now, as N is normal, by'niby € N. Thus, (a1b,)~!a.b. € N, which means that 
a,b,N = ayb2N, as required. 
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Let us check the group properties. As for closure, if aN and bWN are left cosets, 
then so is abN. Also, for any a, b, c € G, we have 


aN (bNcN) = aNbcN = a(bc)N = (ab)cN = (aNbN)cN, 


So associativity is proved. If a € G, then aNeN = aN = eNaN;; thus, eN is the 
identity of G/N. Finally, aNa~'N = eN = a~'NaN;; that is, a~'N is the inverse of 
aN . Therefore, G/N is a group. The group consists of the left cosets, so the order is 
the number of left cosets, which is [G : N]. The proof is complete. 


Notice that the proposed group operation would not even be well-defined if N 
were not a normal subgroup. 


Example 4.7. Let G = U(15) = {1, 2,4, 7, 8, 11, 13, 14}, and let VN = (14) = 
{1, 14}. There is no need to worry about normality, since G is abelian. The left 
cosets are IV = {1,14}, 2N = {2,13}, 4N = {4,11} and 7N = {7, 8}. Thus, 
G/N = {1N, 2N, 4N, 7N}. We note that (4N)(7N) = 13N = 2N and (2N)(4N) = 
8N = 7N. The rest of the group table is given in Table 4.1. We can also use this table 
to find inverses; for instance, 2N7N = 1N. Since IN is the identity, (2N yl =7N. 


Table 4.1 Group table for U(15)/(14) 
1N 2N 4N 7N 


IN|1N 2N 4N 7N 
2N|2N 4N 7N 1N 
A4N|4N 7N IN 2N 
IN\|7N 1N 2N 4N 


Example 4.8. Let G = Zand N = SZ. Again, N is certainly a normal subgroup 
of G. Also, G/N = {0+ N,1+N,2+N,3+N,4+N}. Addition in the factor 
group behaves like modular arithmetic; indeed, (1 + VN) + (2+ N) =3+4+WN and 
34+N)+(4+N)=74+N =2+N. The full group table is given in Table 4.2. 
Note that Z/5Z has precisely the same group table as Zs (see Table 3.1). 


Table 4.2 Group table for Z/5Z 


|O+N 1+N 2+N 3+N44+N 
NJO+N 1+N 2+N3+N 4+N 
tN|1+N2+N3+N4+N04+N 
N\2+N3+N4+N0+N14N 
N)J3+N4+N0+N 14+N24N 
N|4+N 0+N 14+N2+N34N 


RWNrFO 
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Example 4.9. Let G = Dg and N = (Roo). As N has index 2, it is necessarily a 
normal subgroup, by Theorem 4.1. In fact, there are only two left cosets, RoN, which 
consists of all of the rotations, and FN, which consists of all of the flips. The group 
table is given in Table 4.3. 


Table 4.3 Group table for Dg/(Ro0) 


RoN F\N 
RoN|RoN FN 
F,\N|F\N RoN 


Observe that powers of group elements in a quotient group work as we would 
expect. Indeed, (aN) = a'"N, for any integer m. In particular, (aN)~! = a~!N. Let 
us prove a few other useful facts. 


Theorem 4.7. Let G be a group, witha € G and N anormal subgroup of G. Then 


1. if G is abelian, then so is G/N; 
2. if G is cyclic, then so is G/N; and 
3. if|a| =m < @™, then |aN| divides m. 


Proof. (1) If b,c € G, then (bN)(cN) = beN = cbN = (cN)(DN). 

(2) If G = (b), then for any cN € G/N, let us say that c = b*. ThencN = b‘N = 
(bN)*. Thus, G/N = (DN). 

(3) Note that (aN)” = aN = eN. Thus, by Corollary 3.2, the order of aN divides 


mM. 


A small word of caution is in order. Do not assume that the order of a equals that 
of aN. All we know is that |aN| divides |a|. Also, if a has infinite order, then we 
know nothing about |aN|; it could be finite or infinite. 

The following theorem tells us how to determine the subgroups of a factor group. 
The proof, however, is left as Exercise 4.18. 


Theorem 4.8. Let G be a group and N anormal subgroup. Then the subgroups of 
G/N are precisely of the form H/N, where H is a subgroup of G containing N. 
Furthermore, H/N is normal in G/N if and only if H is normal in G. 


Here is one more rather neat fact about factor groups. 
Theorem 4.9. Let G be any group. If G/Z(G) is cyclic, then G is abelian. 


Proof. Let Z = Z(G), and suppose that G/Z = (aZ). Take any b,c € G. Then 
bZ = a"Z, for some integer m, and cZ = a"Z, for some integer n. Thus, b = ay 
andc = a"z, for some y, z € Z. But noting that powers of a commute with each other 
and elements of Z commute with everything, we have be = a” ya"z = a"zay = cb. 
Thus, G is abelian. 
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Corollary 4.1. The centre of a group cannot have prime index in that group. 


Proof. If G/Z(G) has prime order, then by Corollary 3.6, G/Z(G) is cyclic. But 
then the preceding theorem tells us that G is abelian; therefore, Z(G) = G has index 
1, which is not prime. 


Note that it is entirely possible for G to be nonabelian but G/Z(G) to be abelian. 
See Exercise 4.13. 


Exercises 
4.11. Let G be a group having a normal subgroup N. Suppose that in G/N, the 


order of aN is 5. If |N| = 14, what are the possible orders of a? Show that each order 
you find can actually occur in some group. 


4.12. Write the group table for 


1. Dg/{Rigo) 
2. U(40)/(3). 


4.13. Find a nonabelian group G such that 


1. G/Z(G) is abelian 
2. G is infinite, but G/Z(G) is finite. 


4.14. Show that an element of the factor group R/Z has finite order if and only if it 
is in Q/Z. 


4.15. Let G be a finite group having a normal subgroup N. If G/N has an element 
of order 42, show that G has an element of order 42. Does the same hold for infinite 
groups? 


4.16. Let N be a normal subgroup of G. Show that G/N is abelian if and only if 
a'b~'ab € N for alla, be G. 


4.17. Suppose that G has normal subgroups K and N such that G/K and G/N are 
abelian. If K 1 N = {e}, show that G is abelian. 


4.18. Let G have a normal subgroup NV. Show that the subgroups of G/N are pre- 
cisely of the form H/N, where H is a subgroup of G with N C H. Furthermore, 
show that H is normal in G if and only if H/N is normal in G/N. 


4.19. Let G be an abelian group. Show that the elements of finite order in G form a 
normal subgroup N, and that the only element of finite order in G/N is the identity. 


4.20. Let G be a nonabelian group. Show that there exists a subgroup H of G such 
that Z(G) CH CG. 
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4.3 Homomorphisms 


We would like to talk about functions from one group to another. But an arbitrary 
function is not necessarily very useful. We need it to respect the group operation. 
This is the first step towards our goal (realized in the next section) of describing a 
way of determining if two groups have the same structure. 


Definition 4.5. Let G and H be groups. Then a group homomorphism (or, simply, 
homomorphism) from G to H is a function a : G > H such that 


a(g1g2) = a(g1)a(g2) 
for all g1, g2 € G. 


Note that in the above definition, the product g;g> is the product in G, whereas 
the product w(g;)a@(g2) takes place in H. These group operations need not be the 
same. 


Definition 4.6. If a : G — H is a homomorphism, then the kernel of a is the set 
ker(a) = {g € G: a(g) =e}. 


Example 4.10. If n > 2 is a positive integer, then a : Z —> Z, given by a(a) = [a] 
(where we insert the equivalence class brackets for clarity) is a homomorphism. 
Indeed, a(a + b) = [a+ b] = [a] + [b] = a(a) + a(b), for all a,b € Z. Here, 
ker(a) = {a € Z: [a] = [0]} = nZ. 


Example 4.11. Let G be the additive group of integers, and let H be the multiplicative 
group of nonzero rational numbers. Then the function a : G — H given by a(a) = 
2° is ahomomorphism. To check this, we had first better verify that w does indeed 
map G into H. But if a is an integer, then 2° is a nonzero rational number. Also, 
a(a + b) = 24+ = 2°2” = a(a)a(b), as required. We see that ker(a) = {a € Z: 
2% = 1} = {0}. 


Example 4.12. Let G be any group, and consider the map a : G x G > G given 
by a((g1, 2)) = g2. Then a is ahomomorphism. Indeed, if g; € G, then 


ot ((81, 92)(g3, 84)) = &((8183, 8284) = 8284, 


and this is also equal to a((g1, g2))a((g3, g4)). Furthermore, the kernel is {(g, e) : 
geG}=G-x {el}. 


Example 4.13. If G and H are any groups, then a : G — H given by a(g) = e for 
all g € Gis a homomorphism. Indeed, a(g122) = e, and a(g1)a(g2) = e* = e. The 
kernel of @ is all of G. 


We can give a few basic properties of homomorphisms. 
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Theorem 4.10. Let a : G — H be a homomorphism, and take any g € G. Then 


1. a(e) =e€é; 
2. a(g”) = (a(g))", for any integer n; and 
3. if |g| =m < oo, then the order of a(g) divides m. 


Proof. (1) Note that 
a(e) = a(ee) = a(e)a(e). 


Cancelling, we find that a(e) is the identity of H. 
(2) If n > O, then note that 


a(g") = a(gg---g) = a(g)a(g)---a(g) = (a(g))”. 
—_—_—— —_—_——X——SXsa 
ntimes ntimes 


If n = 0, then use part (1). If m = —1, then note that 
a(g)a(gé') = a(gg”!) = ale) =e. 


Similarly, a(g!)a(g) = e. Therefore, a(g~!) = (a(g)) 1. Combining what we 
already know, the case where n < —1 follows immediately. 

(3) We have (a(g))” = a(g”) = a(e) = e. Thus, by Corollary 3.2, the order of 
a(g) divides m. 


The kernel of a homomorphism is rather important, as the following result sug- 
gests. 


Theorem 4.11. Let a: G — H be a homomorphism. Then 


I. ker(a) is anormal subgroup of G; and 
2. a is one-to-one if and only if ker(a) = {e}. 


Proof. Let K = ker(a). Let us show that K is a subgroup of G. By Theorem 4.10, 
a(e) = e, soe € K. Suppose ky, ky € K. Then a(kjko) = a(ki)a(k2) = ee = e; 
hence, kik € K. Also, a(k,') = (a(k))~! = e! = e. Thus, k;' € K, and K isa 
subgroup. If k € K and g € G, then 


a(g-'kg) =a(g ')a(k)a(g) = a(g')ea(g) = (a(g))'a(g) =e. 


Therefore, g-'kg € K, and K is normal. 

Now, suppose that @ is one-to-one. Since a(e) = e, we know that if a(g) = e, then 
g =e. Therefore, the kernel is simply {e}. Conversely, suppose that ker(@) = {e}. 
If a(g1) = a(gz), then at(g1)(a@(g2))~! = e. But this means that a(gigy') = e, and 
therefore £125. € K = {e}. That is, g; = go, and a is one-to-one. 


Two other sorts of subgroups are also useful. 
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Definition 4.7. Let a : G — H be a homomorphism. If L is any subgroup of G, 
then the image of L is a(L) = {a(/) : 1 € L}. If M is any subgroup of H, then the 
preimage (or inverse image) of M is the set a~!(M) = {g € G: a(g) € M}. 


Note that the use of the notation w~'(M) does not imply that the function o is 
invertible. It may or may not be. 


Example 4.14, Consider Example 4.11. If L = 3Z, then a(L) = {2° : a € Z} = 
(84: ae Z}.IfM = {+44 : a €Z}, thena!(M) = 22Z. 


Example 4.15. Let G = Z, and consider a : G x G > G, as in Example 4.12. Let 
L = 3Z x 5Z. Then a(L) = 5Z. If M = 62Z, thena—!(M) = Z x 62. 


We conclude with a few properties of images and preimages. 


Theorem 4.12. Let a: G — H be a homomorphism. Then 


if L is a subgroup of G, then a(L) is a subgroup of H; 

if L is normal in G, then a(L) is normal in a(G); 

if L is cyclic, then a(L) is cyclic; 

if L is abelian, then a(L) is abelian; 

a is onto if and only if a(G) = H; 

ifM <H, then a~!(M) < G; and 

if M is anormal subgroup of H, then a~'(M) is normal in G. 


NAMA WNS 


Proof. (1) We have e € L, soe = a(e) € a(L). If a(l;),a(h) € a(D), then 
a(])a (Iz) = a(Ilb) € a(L), since Ijly € L. Also, (a(1,))7! = a(l,') € a(L), since 
i eL. 

(2) Ifl e L, g € G, then (a(g))~!a(la(g) =a(g—'lg) € a(L), since g'Ig € L. 

(3) If L = (k), then for any a(/) € a(L), we have 1 = k’, for some integer m. 
Then a(/) = a(k’”) = (a(k))”. Thus, a(L) = (a(k)). 

(4) If ld), € L, then a(l))a(h) = ahh) = a(hl) = a(h)a(h). 

(5) This is the definition of “onto”. 

(6) Notice that a(e) = e € M; hence, e € a~!(M). Also, if g}, g2 € a7 '(M), 
then a(g1g2) = a(g1)a(g2) € M, since a(g1), a(g2) € M. Thus, gig. € a7'(M). 
Furthermore, a(g;') = (a(g1))~! € M, since a(g;) € M. Thus, g;' € a !(M). 

(7) Take a € a~'!(M), g € G. Then a(g—!ag) = (a(g))~'a(a)a(g) € M, since 
a(a) € M and M is normal. Thus, g~'ag € a~!(M). 


Exercises 


4.21. Are a and §, described below, homomorphisms? If so, are they one-to-one 
and onto? 


1. G is the group of positive real numbers under multiplication, H is R (under 
addition), a: G > H viaa(a) = logi,a 
2. B:Z—> Z, B(a)=at+1 
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4.22. Let a : Zo x Z27 > Zn7 be given by a((a, b)) = 30, for all a € Zo, b € Zy7. 
Show that a is ahomomorphism. Also, find ker(@), and decide if a is onto. 


4.23. Define w : U(16) x U(16) > U(16) via w((a, b)) = ab~!. Show that a is a 
homomorphism, and find a~!((7)). 


4.24. Describe every homomorphism @ : Zj9 > Zs. 


4.25. Let G be a finite group and a : G > H an onto homomorphism. 


1. If G has an element of order n, must H have one? 
2. If H has an element of order n, must G have one? 


4.26. Let a : G > H be a homomorphism, and suppose that a(g) = h. For any 
a € G, show that a(a) = hif and only if a = gk for some k € ker(a@). 


4.27. Definea : G x G > G viaa((g, h)) = gh. If a is a homomorphism, show 
that G is abelian. 


4.28. Show that a group G is cyclic if and only if there exists an onto homomorphism 
from Z to G. 


4.29. Let N be a normal subgroup of G. Show that there exist a group H and a 
homomorphism a@ : G — H with kernel NV. 


4.30. Let G be the multiplicative group of nonzero complex numbers and H the 
multiplicative group of nonzero real numbers. Does there exist a one-to-one homo- 
morphism from G to H? 


4.4 Isomorphisms 


One of our goals is to establish if two groups are, in effect, the same. To this end, we 
need to strengthen the notion of a homomorphism. 


Definition 4.8. Let G and H be groups. Then a group isomorphism (or, simply, 
isomorphism) from G to H is ahomomorphism from G to H that is bijective. When 
such an isomorphism exists, we say that G and H are isomorphic groups. 


Isomorphic groups have precisely the same structure. The isomorphism simply 
provides new labels for the group elements. 


Theorem 4.13. On any collection of groups, isomorphism is an equivalence 
relation. 
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Proof. Reflexivity: Use the function a : G —> G given by a(g) = g for all g. 
It is easily seen to be an isomorphism. Symmetry: Suppose that a : G — AH is 
an isomorphism. By Theorem 1.3, there exists a function 6 : H — G given by 
B(h) = g, where a(g) = h, and this £ is also bijective. We must check that it 
is a homomorphism. Take any /;,h2 € H, and suppose that 6(h;) = g;. Then 
a(g1g2) = a(gi)a(gs) = hihy; thus, B(hih2) = gig. = (1) (hz), as required. 
Transitivity: Suppose thata : G — H and 8 : H — K are isomorphisms. Let 
y = Boa. By Theorem 1.2, y is bijective. We must check that it is a homomorphism. 
Take any g1, go € G. Then 


¥(g182) = B(a(g182)) = B(a(g1)a(g2)) = B(a(g1)) B(a(g2)) = y(g1)v (g2). 


We are done. 


Therefore, it makes sense to say that G and H are isomorphic; we do not have to 
specify that G is isomorphic to H. In order to verify that a particular function is an 
isomorphism, we have to check three things: it must respect the group operation, it 
must be one-to-one and it must be onto. We can use Theorem 4.11 for the second of 
these; to show that it is one-to-one, it is enough to show that the kernel is trivial. 


Example 4.16. Let us show that Z3 x Zs and Zjs5 are isomorphic groups. We define 
a: Z\5 > Z3 x Zs viaa(a) = (a, a). First, is this well-defined? If a = b in Z)s, 
then 15|(a — b), so 3|(a — b) and 5|(a — b), and therefore (a, a) = (b, b) in Zz x Zs. 
Check that it is ahhomomorphism. If a, b € Z\5, thena(a +b) = (a+b,a+b)= 
(a, a) + (b, b) = a(a) + a(b). Next, let us show that it is one-to-one. If a € ker(a), 
then (a, a) = (0,0). That is, 3Ja and 5|a. Therefore, 15|a, soa = O in Z 15, and 
a is one-to-one. In this case, we do not need to check surjectivity, because the 15 
elements of Z;5 map to 15 different elements of Z3; x Zs. But Z3 x Zs only has 15 
elements! Hence, the function must be onto. 


Example 4.17. Lest we get too comfortable, Z24 is not isomorphic to Z4 x Ze. Why 
not? Notice that 1 has order 24 in Z,. If these groups had precisely the same structure, 
then Z4 x Z> would have to have an element of order 24 as well. But it is easy to 
see that 12(a, b) = (0, 0) for every (a,b) € Z4 x Ze, so every element has order 
dividing 12. 


Example 4.18. As we noted following Example3.4, the set G = {1,—1,i,—-i} 
(where i is the complex number) is a group under multiplication. We claim that 
it is isomorphic to the additive group Z4. To see this, we define a : Z, > G via 
a(0) = 1, a1) = i, a(2) = —1 and a(3) = —i. This function is clearly bijective, 
and we can check that it respects the group operations by comparing the group tables. 
The tables for Z4 and G are shown in Tables 4.4 and 4.5. Note that if we replace 0 
and a(0) with A, | and a(1) with B, and so on, we see that both groups have Table 
4.6. Thus, @ is just a relabelling of the group elements. 


In fact, we can classify all cyclic groups up to isomorphism. 
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Table 4.4 Group table for the additive group Z4 


=i} 


Theorem 4.14. Let G = (a) be acyclic group. If a has infinite order, then G is 
isomorphic to Z. If a has order n < 00, then G is isomorphic to Zn. 


Proof. Let G be infinite cyclic. Define a : Z — G via a(i) = a’. We claim that a 
is an isomorphism. If i,j € Z, then a(i+j) = a’ = aad! = a(i)a(j), as required. 
If i € ker(a), then a’ = e = a’. By Theorem3.8, i = 0. Thus, @ is one-to-one. 
Furthermore, if i ¢ Z, then a’ € a(Z), as a(i) = a’. Thus, a is onto as well, and 
therefore an isomorphism. 

Now suppose that |a| = n < oo. Definea : Z, > G viaa(i) = a’. Here, 
we must check that a is well-defined. But if i = 7 in Z,, then n|(i — j). Thus, by 
Theorem 3.8, a! = a’. The fact that a is an onto homomorphism follows as above. 
If i € ker(a), then a‘ = e, and by Corollary 3.2, n divides i. Thus, in Z,, i = 0. 


Corollary 4.2. [fa group G has prime order p, then G is isomorphic to Z,. 


Proof. Combine Corollary 3.6 and Theorem 4.14. 


So, groups of prime order have as nice a structure as we could ask. With a little 
more work, we can also classify the groups with order twice a prime. 


Lemma 4.1. Let G be a group having distinct commuting elements a and b of order 
2. Then G has a subgroup isomorphic to Zy x Zp. 
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Proof. Given the conditions upon a and b, we can see that H = {e,a,b, ab} isa 
subgroup. (It contains the identity, and closure is easily checked.) Also, H contains 
four distinct elements. (Clearly, e, a and b are distinct. If ab = e = bb, then a = b. 
If ab = a = ae, then b = e. If ab = b = eb, then a = e. These are all impossible.) 
We claim that it is isomorphic to Z. x Z). Leta : H > Z» x Zp be given by 
a(e) = (0,0), a(a) = (1,0), a(b) = (0, 1) and a(ab) = (1, 1). This function is 
clearly bijective, and running through the possible pairs of group elements, we see 
that it is ahomomorphism. 


Corollary 4.3. Every group G of order 4 is isomorphic to either Z4 or Zz x Zo. 


Proof. In a group of order 4, every nonidentity element has order 2 or 4. If there 
is an element of order 4, G is cyclic and, by Theorem 4.14, isomorphic to Zag. 
Otherwise, every nonidentity element has order 2. By Exercise 3.32, G is abelian, 
and the preceding lemma tells us that G has a subgroup isomorphic to Zz x Z. Given 
the order of the group, we are done. 


When p is a prime larger than 2, we are already aware of two possible groups of 
order 2p; the cyclic group and the dihedral one. In fact, those are all of the options. 


Theorem 4.15. Let |G| = 2p, where p is an odd prime. Then G is isomorphic to 
either Zp or Dop. 


Proof. The possible orders for nonidentity elements of G are 2, p and 2p. If G has 
an element of order 2p then it is cyclic and, by Theorem 4.14, isomorphic to Z2,. So, 
assume that every nonidentity element has order 2 or p. 

If every nonidentity element has order 2, then once again, G is abelian and, 
by Lemma 4.1, G has a subgroup of order 4, contradicting Lagrange’s theorem. 
Therefore, let a € G have order p. Take any b ¢ (a). Suppose that |b| = p. Then 
noting that (a) M (b) is a subgroup of both (a) and (b) (see Exercise 3.37), Lagrange’s 
theorem tells us that it can only have order | or p. As b ¢ (a), it must be 1. Thus, 
by Theorem 4.4, |(a)(b)| = |(a)||(b)|/|(a) O (b)| = p?/1 = p?. But this exceeds the 
order of G. Therefore, |b| = 2. 

Now, as (a) has index 2, Theorem 4.1 tells us that it is normal. Thus, b~'ab € (a), 
say b-'ab = a’. But then 


a = bab? = b"'!(b-'ab)b = ba‘b = (b“ab)' = (a')' = a". 


As a has order p, we have i? = 1 (mod p). That is, pi? -—-l)=@-1d+4+1). As 
p is prime, i = +1 (mod p). Thus, b-'ab = aora™!. 

Suppose that b~'ab = a. Then a and b commute. But consider the order of ab. 
If (ab)” = e, then a” = b™” € (a) (b) = {e}, as a and b have different prime 
orders. Thus, p|n and 2|n, so 2p|n. That is, ab has order 2p, which we have excluded. 
Therefore, b~'!ab = a™!. 
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We now know everything about the group. As (a) has index 2 and b ¢ (a), the 
elements of G are precisely a! and ba', 0 < i < p. Furthermore, we know how to 
find the product of any two elements. Indeed, a'a/ = a'*/ (reducing the exponent 
modulo p if necessary), ba‘a’ = ba'*/, 


a‘bal = b(b™'a‘b)a’ = b(b-ab)'a’ = b(a7!)'a' = ba 


and ba‘ba’ = b(ba’~') = a~'. Thus, we can fill in the entire group table for G, 
and we have precisely the same group structure as in the dihedral group! Indeed, 
letting F be any flip in Dz,, we define a : G + Dpyy via a(a') = R360i/p and 
a(ba') = FR360i/p. Then a is certainly bijective and it is a homomorphism as 
well. 


We can also mop up a proof we postponed. 


Theorem 4.16. [fm and n are relatively prime, then U (mn) is isomorphic to U (m) x 
U(n). 


Proof. Define a : U(mn) > U(m) x U(n) via a(a) = (a, a). If (a, mn) = 1, then 
(a, m) = (a,n) = 1, so we have (a, a) € U(m) x U(n) whenever a € U (mn). Let us 
verify that a is well-defined. But if a = b in Z,,, then mn|(a — b), so m|(a — b) and 
n|(a — b), and therefore (a, a) = (b, b) in U(m) x U(n). It is also a homomorphism; 
indeed, if a,b € U(mn), then a(ab) = (ab, ab) = (a, a)(b, b) = a(a)a(b). Let us 
check that a is one-to-one. But if a € ker(q), then (a, a) = (1, 1) in U(m) x U(n); 
that is, m|(a — 1) and n|(a — 1). As m and 7 are relatively prime, mn|(a — 1). That 
is, a = | in U(mn); hence, a@ is one-to-one. Finally, we must show that a is onto. 
Take any (c,d) € U(m) x U(n). By the Chinese Remainder Theorem, there exists 
an a such that a = c (mod m) and a = d (mod n). Furthermore, to show that a 
is in U(mn), it suffices to show that it is relatively prime to both m and n. Without 
loss of generality, suppose that (a,m) = k > 1. Then as k|a and k|m, we see that 
k|c as well. But then (c,m) 4 1, which is impossible. Therefore, a € U(mn) and 
a(a) = (c, d). Thus, @ is indeed an isomorphism. 


This gives us the second part of Theorem 3.19. 
Corollary 4.4. [fm and n are relatively prime, then y(mn) = g(m)g(n). 


Proof. The order of U(k) is p(k). As isomorphic groups have the same order, the 
preceding theorem completes the proof. 


We wish to add one more point to Corollary 4.3 and Theorem 4.15. If we are to 
classify groups of a particular order up to isomorphism, we had better ensure that 
the groups we have listed are not isomorphic to each other. Proving that two groups 
are not isomorphic generally involves finding a property that one has but the other 
lacks. For instance, Z, is cyclic for all primes p, but neither Z, x Zz nor Do, is 
cyclic (indeed, D2, is not even abelian). Some properties that can be useful follow. 
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Theorem 4.17. Let G and H be isomorphic groups. Then 


1. Gis abelian if and only if H is abelian; 

2. G is cyclic if and only if H is cyclic; 

3. |G| = |H]; 

4. for any positive integer n, G and H have the same number of elements of order 
n (which could be an infinite number); 

5. for any positive integer n, G and H have the same number of subgroups of order 
n (which could be an infinite number); and 

6. for any positive integer n, G and H have the same number of normal subgroups 
of order n (which could be an infinite number). 


Proof. Leta : G — H beanisomorphism. (1) Asa(G) = H, we see from Theorem 
4.12 that if G is abelian, so is H. But the same can be said for a~! : H > G. 

(2) Same idea. 

(3) An isomorphism is a bijection. 

(4) Take g € G of order n. By Theorem 4.10, |a(g)| divides |g]. But by the 
same argument, |g| = |a~!(a(g))| divides |a(g)|. Thus, |g| = |a(g)|. That is, the 
elements of order n in G are in one-to-one correspondence with the elements of order 
nin ll. 

(5) Let L be a subgroup of G of order n. Then a(ZL) is a subgroup of H, and it 
is isomorphic to L; hence, it has the same order. If M is some other subgroup of G, 
then since @ is one-to-one, a(M) is a different group. Thus, H has at least as many 
subgroups of order n as G does. But applying a~', we find that G has at least as 
many subgroups of order n as H does. 

(6) Let L be a normal subgroup of order n in G. Then by Theorem 4.12, a(ZL) is 
a normal subgroup of a(G) = H. Now proceed as in (5). 


Example 4.19. As U(10) = (3) is cyclic of order 4, we know that U (10) is isomor- 
phic to Z4. Now, U(8) is an abelian group of order 4, but it is not cyclic, so it is not 
isomorphic to U(10). By Corollary 4.3, U(8) is isomorphic to Zz x Zp. 


Example 4.20. Consider the groups U(20) and U(8) x U(3). Each is an abelian 
group of order 8 and neither is cyclic; however, they are not isomorphic. To see this, 
note that U (20) has exactly three elements of order 2; namely, 9, 11 and 19. However, 
U(8) x U(3) has too many elements of order 2; in fact, all seven nonidentity elements 
have that order. 


Exercises 


4.31. For each of the following pairs of groups, explain why they are not isomorphic. 


1. Z4 x Z4 and Z4 x Zo x Zo 
2. GL,(R) and R 
3. ZandZx Z@ 


4.32. For each of the following pairs of groups, explain why they are or are not 
isomorphic. 
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: Zo X Zz and Z3 x Z3 x Zs 
: Z\ and Z3 x Z7 

. U(22) and Zyo 

: Do and Zo x Zi0 


BRWNe 


4.33. Let G be the set of all matrices of the form ¢ i) for all integers a. Show 


that G is a subgroup of GL2(R). To what familiar group is it isomorphic? 
4.34. Show that Z is not isomorphic to Q. 

4.35. Let H < G anda € G. Show that H and a~'Ha are isomorphic. 
4.36. Show that G x H is isomorphic to H x G. 

4.37. Show that Z is isomorphic to a proper subgroup of itself. 


4.38. Let G be any group. Let H consist of the same set of elements as G, but with 
a new operation given by a * b = ba, for all a and b. Show that H is a group, and 
that it is isomorphic to G. 


4.39. Consider the group G from Exercise3.42. Show that it is isomorphic to a 
proper subgroup of itself. 


4.40. Consider the group H from Exercise 3.42. Show that it is isomorphic to the 
multiplicative group of positive rational numbers. 


4.5 The Isomorphism Theorems for Groups 


In this section, we will discuss three theorems that can aid us in showing that certain 
groups are isomorphic. The first of these theorems is the most important, and is used 
to prove the other two. 


Theorem 4.18 (First Isomorphism Theorem for Groups). Let a: G — H bea 
homomorphism. Then G/ ker(a@) is isomorphic to a(G). 


Proof. Let K = ker(a). We know that K is a normal subgroup of G. Define £6 : 
G/K — a(G) via B(aK) = a(a). We claim that f is an isomorphism. 

First, we must show that f is well-defined. Suppose that aK = bK. Thena™'b € 
K, and therefore a(a~'b) = e. That is, (a(a))~!a(b) = e, so a(a) = a(b). Thus, B 
is well-defined. 

Also, 6 is ahomomorphism. Indeed, 


B(aKbK) = B(abK) = a(ab) = a(a)a(b) = B(aK)B(bK). 


Next, let us check that 6 is one-to-one. Suppose that aK € ker(6). Then a(a) = e, 
which means that a € K, so aK = eK. That is, ker(8) = {eK}, and 6 is one-to-one. 
Finally, we must verify that 6 is onto. Take a(a) € a(G). Then B(aK) = a(a). 
We are done. 
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The First Isomorphism Theorem is a crucial tool in proving that groups are iso- 
morphic. It is also an enormous time-saver! Whenever we are asked to show that 
something along the lines of G/N is isomorphic to H, all we need to do is find a 
homomorphism from G onto H with kernel NV. We do not need to define a function 
on cosets and check that it is well-defined. 


Example 4.21. For any integer n > 2, Z/nZ is isomorphic to Z,. Indeed, define 
a: Z — Z, via a(a) = [a] (where we insert the equivalence class brackets for 
clarity). This is ahomomorphism, as a(a+ b) = [a+b] = [a]+[b] = a(a)+a(d), 
for alla, b € Z. Also ker(a) = {a € Z: a =0 (mod n)} = nZ. Finally, if [a] € Zp, 
then w(a) = [a], so a@ is onto. The First Isomorphism Theorem completes the proof. 


Example 4.22. We claim that GL2(R)/SL2(R) is isomorphic to the multiplicative 
group of nonzero real numbers, which we denote by H. Indeed, definew : GL,(R) > 
H via a(A) = det(A). As an invertible matrix has a nonzero determinant, the image 
of GL2(R) is indeed contained in H. Also, if A, B € GL2(R), then a(AB) = det(AB) 
and since determinants respect products, this is det(A) det(B) = a(A)a(B). Thus, a 
is ahomomorphism. By definition, its kernel is SL2(R). Finally, if a €¢ H, then 


«((ci)) = 


and therefore a is onto. Now we apply the First Isomorphism Theorem. 


Example 4.23. Let us show that if G and H are any groups, then G x H has a factor 
group isomorphic to G. Define aw : G x H —> G viaa((g,h)) = g, forall g € G, 
h € H. Check that a is a homomorphism. If g; € G, h; € H, then 


a((g1, 41) (g2, h2)) = a((g1g2, hih2)) = gigo = a((g1, h1))a((g2, h2)). 


Also, if g € G, then a((g,e)) = g, soa is onto. Therefore, (G x H)/ker(@) 
is isomorphic to H. If we wish to specify the group being factored out, note that 
ker(a) = {(e,h): he H}={e} x A. 


Theorem 4.19 (Second Isomorphism Theorem for Groups). Let G be a group 
with H and N subgroups, such that N is normal. Then H /(H 1 N) is isomorphic to 
HN /N. 


Proof. We will show that H 1 N is normal in H by demonstrating that it is the 
kernel of a homomorphism. Also, by Theorem 4.5, HN is a subgroup of G, since 
N is normal. Define a : H — HN/N via a(h) = AN. As H C AHN, we see 
that hN € HN/N. Observe that a is a homomorphism. Indeed, if 1,4. € H, 
then a(hjh2) = hyhoaN = (hiN)(yN) = a(hy)a(h2). Also, if hn € HN, then 
a(h) = hN = hnN, since h-'hn = n € N. Thus, a is onto. Finally, ker(w) = {h € 
H:hN =eN} ={heH:heN} =ANN. The First Isomorphism Theorem 
finishes the proof. 
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Theorem 4.20 (Third Isomorphism Theorem for Groups). Let G be a group and 
suppose that N and K are normal subgroups, with K C N. Then (G/K)/(N/K) is 
isomorphic to G/N. 


Proof. Definea : G/K — G/N viaa(aK) = aN, for anya € G. First, let us check 
that this is well-defined. But if aK = bK, then a~'b € K C N, soaN = DN. Next, 
let us show that a is ahomomorphism. But 

a((aK)(bK)) = a(abK) = abN = (aN)(bN) = a(aK)a(bK). 


Furthermore, if aN € G/N, then a(aK) = aN, so a is onto. Finally, 


ker(a) = {aK € G/K : aN = eN} = {aK € G/K: aE NJ=N/K. 


We now apply the First Isomorphism Theorem. 


Example 4.24. The Third Isomorphism Theorem tells us that (Z/12Z)/(4Z/12Z) 
is isomorphic to Z/4Z. (Admittedly, we could have worked this out by noting that 
Z is cyclic, so its factor group is cyclic, and the factor group of the factor group is 
cyclic, and that every cyclic group of order 4 is isomorphic to Z4, which in turn is 
isomorphic to Z/4Z. But isn’t this faster?) 


Exercises 


4.41. LetG = Z x Zand N = {(a,a) : a € Z}. Show that G/N is isomorphic to 
Zi. 


4.42. For any groups G and H, show that (G x H)/(G x {e}) is isomorphic to H. 


4.43. Show that R/Z is isomorphic to the multiplicative group H = {a+ bie C: 
a® + b* = 1}. 


4.44, Let G be an abelian group and n a positive integer. Consider the groups H 
and K from Exercise 3.40. Show that G/H is isomorphic to K. 


4.45. Let G be the group from Exercise 3.16. 


1. Find Z(G). 
2. Show that G/Z(G) is isomorphic to Z x Z. 


4.46. Let G be a group having subgroups WN and K of index 2, such that N 4 K. 


1. Show that [V : NM K] = 2. 
2. Show that G/(N M K) is isomorphic to Zz x Zo. 
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4.6 Automorphisms 


One particular type of isomorphism deserves special mention. 


Definition 4.9. Let G be any group. Then an automorphism of G is an isomorphism 
a :G-— G. The set of all automorphisms of G is called the automorphism group 
of G, and is denoted Aut(G). 


Example 4.25. Let G be any abelian group. Then the function a : G — G given by 
a(a) = a! for all a € G is an automorphism. To see that a is a homomorphism, 
note that a(ab) = (ab)~'! = a~'b~! = a(a)a(b). (Note that this would not work if 
G were nonabelian, as (ab)~! = b~'a7'!) If a(a) = e, then a~! = e, soa = e and 
a is one-to-one. Also, if a € G, then a(a~!) = a. Thus, a is onto as well. 


Theorem 4.21. For any group G, the automorphism group of G is a group under 
composition of functions. 


Proof. As we noted in Theorem 4.13, the composition of two isomorphisms is an 
isomorphism; therefore, the same follows for automorphisms, so Aut(G) is closed. 
By Theorem 1.2, the composition of functions is always associative. Certainly, the 
identity function that fixes every element of G is an automorphism, and serves as an 
identity for Aut(G). Finally, we saw in Theorem 4.13 that every isomorphism has 
an inverse isomorphism; thus, each automorphism has an inverse. 


Generally speaking, determining Aut(G) for an arbitrary group G is a difficult 
problem. But we can, at least, solve it when G is cyclic. 


Theorem 4.22. Let G = (a) be acyclic group. Then 


1. ifa has infinite order, then Aut(G) is isomorphic to Zz; and 
2. if |a| =n < oo, then Aut(G) is isomorphic to U(n). 


Proof. Let a € Aut(G). If a(a) = a’, then for every j € Z, we have a(a/) = 
(a(a)y = (a'y. In particular, G = a(G) = (a'). Thus, a’ must generate G. Con- 
versely, suppose that G = (a’), and a(a) = a’. Then we can only have a(a@/) = a! 
for all integers 7. We claim such an @ is an automorphism. Indeed, 


a(da’) = a(a’**) = ai'Vt® = alia* = a(a/)a(a*), 


so @ is a homomorphism. If a(a@’) = e, then (a') = e. If a has infinite order, then 
ij = 0, and therefore j = 0. If |a| = n < ov, then n|ij. But as |a| = |a'| = |G|, 
Corollary 3.2 tells us that (n, i) = 1. This means that n|j, so a’ = e. Either way, 
ker(a) = {e}. As a’ is a generator, it follows immediately that a is onto. The claim 
is proved. Thus, the automorphisms of G are precisely given by a(a’) = a'/, where 
a’ is a fixed generator of G. 

If a has infinite order, then the only generators of (a) are a and a~!. Indeed, if 
a™ were a generator, then we would have to have a = (a’)', for some / € Z. But 
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then ml = 1, which means that m € {1, —1}. It is clear, on the other hand, that both 
a and a7! are generators. Thus, Aut(G) has order 2. By Corollary 4.2, Aut(G) is 
isomorphic to Zp. 

Now suppose that |a| =n < oo. Let us define y : Aut(G) > U(n) via y(@) = 1, 
where a(a) = a’. Again, Corollary 3.2 tells us that since |a| = |a'| = |G|, we have 
(i,n) = 1, soi € U(n). Now, if a, B € Aut(G), with w(a) = a! and B(a) = a’, then 


(a o B)(a) = a(B(a)) = a(a’) = (a(a)y = al. 


Thus, y(a@ o 6) = ij (reducing modulo n if necessary). But y(a)y(B) = ij as well, 
so y is ahomomorphism. If y(~) = 1, then a(a) = a, and hence a@ is the identity 
automorphism. Therefore, ker(y) is trivial. Finally, if i ¢ U(m), then as we have 
observed, a! is a generator of G, and we obtain a € Aut(G) such that a(a) = a’. 
Therefore, y (a) = i, and y is onto. Hence, y is the isomorphism we seek. 


In particular, we see that the automorphism group of a cyclic group is abelian. 
It would be a mistake to think that the automorphism group of an abelian group is 
necessarily abelian, as the following example indicates. 


Example 4.26. Let G = Zy x Zp. Then define a : G > G via a((0, 0)) = (0, 0), 
a((1,0)) = (0, 1), w((O, 1)) = C1, 0) and a((1, 1)) = C1, 1). Also, let 6((0, 0)) = 
(0,0), 6(,0)) = (1,0), BQO, 1) = C, 1) and BCCI, 1)) = (0, 1). Clearly, a 
and 6 are both bijective. The group is also small enough that one can check all of 
the possibilities and find that they are homomorphisms. Therefore, a, 6 € Aut(G). 
But a(B((1, 0))) = a((1, 0)) = (0, 1), whereas B(a((1, 0))) = B(O, 1) = C1, D. 
Thus, a o B £ B oa, so Aut(G) is nonabelian. In Exercise 4.49, we must show that 
Aut(G) is isomorphic to De. 


Let us define a particular type of automorphism. 


Definition 4.10. Let G be a group and a ¢€ G. Then the inner automorphism 
induced by a is 0, : G > G given by 6,(g) = a-'ga for all g € G. The inner 
automorphism group of G is Inn(G) = {@, : a € G}. 


Inner automorphisms are only interesting when the group G is nonabelian; for 
abelian groups, every inner automorphism is the identity function, as a~!ga = g. 
Let us list a few basic properties of inner automorphisms. 


Lemma 4.2. Let G be a group and a,b € G. Then 


1. 64 € Aut(G); 
2: 6, ° Op = Ona? and 
3. (Og)7! = 04-1. 


Proof. (1) First, let us show that 6, is a homomorphism. If g, i € G, then 


6,(gh) = a7! gha = (a~' ga)(a~ ha) = 04(g)6a(h). 
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If 6,(g) = e, then a~'!ga = e, so g = aea™! = e; thus, ker(6,) = fe}, and @, is 
one-to-one. Finally, if g € G, then 6,(aga~') = a'aga”'a = g; thus, 6, is onto. 
(2) If g € G, then 6,(0;,(g)) = O4(b-' gb) = a7'b7! gha = Opa(g). Thus, 6,065 = 
Oba. 
(3) If g € G, then 6,,(6,-1(g)) = 0,(aga~!) = a~!aga~'a = g; thus, 0, 0 6,-1 is 
the identity function. Similarly, so is 6,-1 0 Og. 


Theorem 4.23. For any group G, Inn(G) is anormal subgroup of Aut(G). 


Proof. By the preceding lemma, Inn(G) C Aut(G). Certainly 6. € Inn(G) is the 
identity automorphism. Also, the preceding lemma shows that Inn(G) is closed 
under composition and the taking of inverses. Therefore, Inn(G) < Aut(G). To 
show normality, take wa €¢ Aut(G) and 6, € Inn(G). Then 


(a! 06,0 a)(g) = a | (04(a(g))) 
= a—!(a~!e(g)a) 
=a!(aq!)a~'(a(g))a7 | (a) 
= (a '(a))"'ga7'(a) 
= Ou-1(a) (8); 


for all g € G. That is,a~!06,0a= Oy-!(a) € Inn(G), and Inn(G) is normal. 


It is certainly possible for Aut(G) to be larger than G; indeed, Example 4.26 
provides such a group. But there is only one inner automorphism for each group 
element. However, 6, does not have to be different from 6, if a € b. For instance, if 
a and b are both central, then 6, and 6; are both equal to the identity automorphism. 
The following theorem tells the tale. 


Theorem 4.24. Let G be a group. Then 


1. ifa,b € G, then 0, = > if and only if ba! € Z(G); and 
2. G/Z(G) is isomorphic to Inn(G). 


Proof. (1) Take any a,b € G. Then 6, = 0, if and only if a~'ga = b~'gb for all 
g € G. But this occurs if and only if ba~'!g = gba“! for all g € G. In other words, 
if and only if ba~! is central. 

(2) Define a : G > Inn(G) via a(a) = 6,-1. Let us show that a is a homomor- 
phism. If a, b € G, then 


a(ab) = Oap)-1 = Op-1q-1 = Og-1 0 1 = a(a)a(b), 


making use of Lemma 4.2. Also, if 6, € Inn(G), then w(a~') = @,, so a is onto. 
Furthermore, a € ker(a) if and only if 6, = 6. By (1), this happens if and only if 
a = ae! € Z(G). Now apply the First Isomorphism Theorem. 


Example 4.27. As the centre of $3 is trivial, we see that Inn(S3) is isomorphic to $3. 
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Example 4.28. As Z(Dg) = (Rigo), the distinct elements of Inn(Dg) are of the form 
6,, where we take one a for each left coset of (Rigg) in Dg. That is, Inn(Dg) = 
{9p > Ro» OF, , OF, }- In particular, it is a group of order 4, so by Corollary 4.3, it is 
isomorphic to either Z4 or Zy x Zp. But every flip in Dg already has order 2, and the 
square of every rotation is in Z(Dg). Therefore, we see that there is no element of 
order 4 in Dg/Z(Dg); thus, it must be isomorphic to Z x Zp. 


Exercises 


4.47. Let G be an abelian group of order n, and let m be a positive integer relatively 
prime to n. Show that a : G > G given by a(a) = a” is an automorphism of G. 


4.48. Let G be a group with automorphism @ and H a group with automorphism 
B. Show that y : Gx H — Gx H given by y((g,h)) = (a(g), B(A)) is an 
automorphism. 


4.49. Show that the automorphism group of Z2 x Zp is isomorphic to De. 


4.50. Let G and H be isomorphic groups. Show that their automorphism groups are 
also isomorphic. 


4.51. Let a be an automorphism of G. Show that {a € G : a(a) = a} is a subgroup 
of G. 


4.52. Let a and 6 be any two automorphisms of G. Show that {a € G : a(a) = 
B(a)} is a subgroup of G. 


4.53. For any group G, an automorphism a of G is said to be a power automorphism 
if a(H) C H for every subgroup H of G. If G = (a) x (b) is the direct product 
of two cyclic groups, and @ is a power automorphism of G, show that there exists a 
k € Z such that a(g) = g* forall g € G. 


4.54. To what familiar group is the inner automorphism group of D)2 isomorphic? 


4.55. Let a be an automorphism of Q. Show that for every g € Q, we have a(qg) = 
qa(1). 
4.56. Let G be a group such that the automorphism group of G is trivial. 


1. Show that G is abelian. 
2. Show that a” = e for every a € G. 


Chapter 5 ®) 
Direct Products and the Classification ectics 
of Finite Abelian Groups 


We can now determine the structure of finite abelian groups. In particular, every such 
group is isomorphic to a direct product of cyclic groups, each having prime power 
order. The proof of this result is our main goal in the present chapter. 


5.1 Direct Products 


We defined the direct product of two groups in Definition 3.3. There is no particular 
reason that we need to restrict ourselves to two. 


Definition 5.1. Let G,,..., G, be any groups. Then the (external) direct product 
G, x G2 x --- x G, is the Cartesian product of the groups G; under the operation 
(aj, ...,Ax)(b),..., bp) = (aby, ..., agbx), for all a;, b; € G;. (We allow k = 1 
here, in which case G = Gj.) 


Theorem 5.1. [f G,,..., Gx are groups, then G, x --- X Gx is a group. 


Proof. The proof is essentially identical to that of Theorem 3.1. 


The reason we used the word “external” in the above definition is that the groups G; 
are not subgroups of the direct product; indeed, they are not even subsets. However, 
G;, is, for instance, isomorphic in a natural way to G; x {e} x --- x {e}, which is a 
subgroup of the direct product. What we would like is a way of showing that a group 
is isomorphic to the direct product of certain subgroups. To this end, let us consider 
the following. 


Definition 5.2. Let G be a group, and let Ni,..., Nx be subgroups of G. Then we 
say that G is the internal direct product of N,..., Nx if 
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1. each N; is normal; 
2: N\N2- -+ Np = G; and 
3. foreachi, 1 <i <k, we have (N,N2---N;) O Nia: = {e}. 


(Again, we allow k = 1, in which case G = Nj.) 


In particular, G is the internal direct product of normal subgroups Ny and N» if 
and only if Nj Nz = G and N,N N2 = {e}. 


Example 5.1. Let G = Zo, Ni = (4) and Nz = (5). As Gis abelian, every subgroup 
isnormal. Also, N; = {0, 4, 8, 12, 16} and Nz = {0, 5, 10, 15}. Thus, N;NN2 = {0}. 
For each a € G, we could find n; € N; and nz € N> such that a = n, + n> but, in 
fact, we can avoid this by noting that | Nj + N2| = |Ni||N2|/|N10N2| = 5-4/1 = 20 
(see Theorem 4.4). Thus, VN; + N2 = G, and G is the internal direct product of N, 
and N>. 


Note that if there are more than two groups, then we need to check more than just 
that each N; 1 N; = {e} for the final part of the definition. 


Example 5.2. Let G = Z39, N; = (15), No = (10) and N3 = (6). Again, normality 
is not an issue. It is easy to see that Nj N Nz = {0}. Thus, |N; + N2| = |Ni||N2| = 2- 
3 = 6. As every element of Nj + N2 is in (5), we see immediately that N; + N2 = (5). 
But now we observe that (VN; + N2) 1 N3 = {0}. Then the same argument shows 
that |N; + N2 + N3| = 30, and we know that N; + Nz + N3 = G. Therefore, G is 
the internal direct product of N;, Nz and N3. 


Let us see how internal direct products behave. Here are some highly useful facts. 


Lemma 5.1. Let G be a group with normal subgroups K and N. If K 1 N = {e}, 
thenkn = nk forallke K,neé N. 


Proof. Leth = (nk)~!(kn) = k7'!n7'kn. As K isnormal,n~'kn € K,soh € K.As 
N isnormal, k~!n-'k € N,soh € N. Since KNN = {e}, we have (nk)~! (kn) = e, 
and therefore kn = nk, as required. 


Lemma 5.2. If G is the internal direct product of N\,..., Nx, then every element 
of G can be written in exactly one way as ninz---ng, with each n; € Nj. 


Proof. Since G = N,--- Nx, we know that every element of G can be written in 
such a way. We only need to show uniqueness. Our proof is by induction on k. If 
k = 1, there is nothing to do, as G = N,. Assume that k > | and the result holds 
for groups written as an internal direct product of a smaller number of subgroups. 
Suppose that n; ---mp_jng = hy - + hy_y hg, with n;, h; € N;. Then 


hyn) = (hy ++ Agi) | (my ++ me—1) € Ne O(N + Ne-1) = fe}. 
Therefore, ny = hz, and we have ny ---ng_) = hy ---hy_y in N| No --- Ng_1, which 


is an internal direct product of k — 1 subgroups. By our inductive hypothesis, n; = h; 
for all i. 
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Example 5.3. As we saw in Example 5.2, Z39 is the internal direct product of (15), 
(10) and (6). Note, for instance, that 23 = 15+ 20+ 18. By the above lemma, there 
is no other way to write 23 as a sum of elements in (15), (10) and (6). 


And now, the big reason why we are interested in these internal direct products. 


Theorem 5.2. Let G be a group, and suppose that it is the internal direct product of 


normal subgroups N,,..., Ng. Then G is isomorphic to the external direct product 
Ni KPI Nx. 
Proof. Define a : Nj x +--+ x Ny > G viaa((n,...,2,)) = ny---ny. We claim 


that a is an isomorphism. In view of Lemma 5.2, a is bijective. Thus, it remains to 
show that it is ahomomorphism. Take n;, h; € N;. Then 


a((my,...,m)(Ay,..., hg) =a((njhy, ..., nghy)) = ny hynzhzn3h3 ---nghg. 


As N,; and N, are normal subgroups, and N; 1 Ny = {e}, Lemma 5.1 says that 
hyn2 => nohy. Thus, 


nyhynohon3h3 mene nyhg = nynohyhon3h3 pues nyhg. 


By Theorem 4.5, N, N> is anormal subgroup of G, and we know that (N,N) N3 = 
{e}. Therefore, hjh2n3 = n3h,h2. We now have 


nyhynzhon3h3 mee nyhg => nynon3hyhoh3ngh4 eee nhg. 
Repeating this procedure, we find that 


a((ny,..-,mg)(Aq,..-,4g)) = nyng---nghyhg-+-hey =a((nq,..-,K))a((Aq, -.-, AK))- 


Thus, a is ahomomorphism, and the proof is complete. 


As a result of this theorem, we will engage in a small abuse of notation and write 
G = N,; x No x--- x Ny when G is the internal direct product of Ni,..., Nx, as 
well as for the external direct product. 


Example 5.4. By Example 5.2, Z39 = (15) x (10) x (6). 


Example 5.5. We claim that U(8) = (3) x (7). As the group is abelian, normality is 
not an issue. Also, |3| = |7| = 2, so the intersection of these cyclic subgroups must 
be trivial. Furthermore, 1 = 1-1,3 = 3-1,7 = 1-7 and5 = 3-7, so U(8) = (3)(5) 
(or just use an order argument). Thus, we have an internal direct product. 


Exercises 


5.1. Write U (32) as the internal direct product of two proper subgroups. 
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5.2. LetG = H x K.Ifh € H has order m and k € K has order n, what is the 
order of (h, k)? 


5.3. How many elements of order 5 are there in Zs x Z25? How many elements of 
order 25? 


5.4. How many cyclic subgroups of order 5 are there in Zs x Z25? How many cyclic 
subgroups of order 25? 


5.5. Show that Dg is not the internal direct product of two proper subgroups. 


5.6. Let |a| = 4 and |b| = 2. Write (a) x (b) as the internal direct product of two 
proper subgroups in every possible way. 


5.7. Show that in Definition 5.2, it is not sufficient to replace the third condition 
with the stipulation that N; 0 N; = {e} whenever i 4 j. In particular, find a group 
G with normal subgroups N,, Nz and N3 such that Nj N2N3 = G and Nj; 1N; = {e} 
whenever i ~ j, butG # N; x No x N3. 


5.8. Let G = (a) be cyclic of order 84. Show that G = (a!*) x (a?!) x (a?8). 


5.9. Suppose that G = N; x Np is an internal direct product. If a: G > H is an 
onto homomorphism, does it follow that H = a(N,) x a(N>)? Prove that it does, 
or give an explicit counterexample. 


5.10. Let G bea group having finite normal subgroups Nj, ..., Nx, such that the gcd 
of |N;| and |N;| is 1 whenever i 4 j. Show that Nj N2--- Ny = Ni x No x--++x Nx. 


5.2 The Fundamental Theorem of Finite Abelian Groups 


Let us now classify the finite abelian groups. We will break our proof down into 
stages. For the first stage, we need a definition. 


Definition 5.3. Let p be a prime number. Furthermore, let G be a group anda € G. 
We say that a is a p-element if the order of a is p” for some integer n > 0. If every 
element of G is a p-element, then G is a p-group. 


Example 5.6. The dihedral group Dg is a 2-group, as every element has order 1, 2 
or 4. On the other hand, Z4 is not a p-group. Indeed, 12 and 18 are both 2-elements 
and 8 is a 3-element, so it cannot be a p-group. In fact, 1 is not a p-element, for any 
prime p. 


Lemma 5.3. Let p be a prime and G an abelian group. Then the p-elements of G 
form a subgroup. 
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Proof. Let H be the set of all p-elements of G. As e has order p®, we have e € H. 
Let a,b € H. Then say that |a| = p” and |b| = p”™. Let k be the larger of m and 
n. Then as G is abelian, (ab)? = a? b? = e? = e, as |a| and |b| both divide p*. 
Thus, |ab| divides p*, and therefore ab € H. Finally, if a € H, then |a| = |a~'|, so 
a~' € H. Thus, H is indeed a subgroup of G. 


Note that the preceding lemma does not work for nonabelian groups. Indeed, in $3, 
123 123 : 123 
we can see that (; 1 ;) and G 3 >) both have order 2, but their product, (; 3 a 
has order 3. 
The following result is also very handy. 


Lemma 5.4. Let G be any group and lete 4a € G be such that a has finite order. 
Thena = a" qa" .-- a" for some integers n,, ..., x, where eacha”™ is a p;-element, 
for some prime p; dividing |a\|. 


Proof. Our proof is by induction on the number of distinct primes, /, dividing |a]. If 
1 = 1, thena isa p-element, so just letn; = 1. Suppose that / > 1 and that the result 
is true for smaller values of /. Let p be a prime dividing |a|, and say that |a| = p’"q, 
with (p,q) = 1. By Corollary 2.1, there exist u,v € Z such that p”u + qv = 1. 
Then 


m m 


Ceo Sar Sa har 


Now, (a7)?" = a?"4 = e;hence, a’ isa p-element and so is (a%)”. So, let p; = pand 
n, = qv. Similarly, the order of (a?")" divides q, and qg has fewer primes dividing 
it than |a|. Thus, by our inductive hypothesis, a?" can be written as a product of 
powers (which are also powers of a) in the manner stated in the theorem. The proof 
is complete. 


We can now simplify our task by breaking a finite abelian group down into a direct 
product of p-groups. 


Lemma 5.5. Let G be a nontrivial finite abelian group, and let p,,..., pg be the 
distinct primes dividing |G|. Then G = H, x Hp x--- x Hy, where H,; is the subgroup 
of G consisting of all of the p;-elements of G. 


Proof. Lemma 5.3 tells us that the H; are subgroups and, as G is abelian, we do not 
have to worry about normality. Let us show that G = H, H2--- Hy. But taking any 
a € G, we see from Lemma 5.4 that a can be written as a product of elements from 
various Hj. (If a = e, there is obviously nothing to worry about.) Finally, we must 
show that for eachi, 1 <i < k, we have (H, --- H;) 1 Hj4; = {e}. But suppose that 
a € Hj, and, simultaneously, a = a, ---a;, witha; € H;. Then letting |a;| = ys 
and m = pj''---p;", we have a” = a'"---a!", and since each |a;| divides m, 
we conclude that a” = e. Thus, |a| divides m. But also, a is a p;+1-element. As 
(m, pi+i1) = 1, the only possible conclusion is that a = e, and we have an internal 


direct product. 
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We can now focus our attention on finite abelian p-groups. The following lemma 
does the biggest part of the work. It is the most difficult proof we have encountered 
so far, and will take some time to absorb. 


Lemma 5.6. Let G be a finite abelian p-group, and let a € G be an element of 
largest possible order. Then G = (a) x H, for some subgroup H of G. 


Proof. Our proof is by strong induction on |G]. If |G| = 1, then a = e and using 
H = (e) will work. So, assume that |G| > 1 and that the lemma holds for groups of 
smaller order. 

Let |a| = p”, with n a positive integer. If (a) = G, then we can use H = (e), 
so assume that (a) #4 G. Take b € G such that b ¢ (a). As b is a p-element, we 
know that b” =e € (a), for some positive integer k. Let m be the smallest positive 
integer such that b?” € (a), and letc = b?"'. Thenc ¢ (a), butc? = b?” € (a). In 
particular, let us say that c? = a', withi € Z. 

Now, as G is a p-group, and the largest element order is p”, we must have 
cP?" = e. Thus, |c?| divides p”~!. Suppose that (p,i) = 1. Then by Corollary 3.2, 
|c?| = |a'| = p", which is impossible. Thus, p divides i; let us say that i = pj. 
Then let d = a~/c. Note that a/ € (a); thus, if d € (a), then c = a/d € (a), which 
is a contradiction. Therefore, d ¢ (a). However, d? = a~/?c? = (a')“'c? = e; 
thus, |d| = p. 

Now, let us consider the group M = G/(d). (As G is abelian, we do not have to 
worry about (d) being normal.) We note that M is still abelian (by Theorem 4.7), its 
order is [G : (d)] = |G|/p and it is a p-group with the orders of elements dividing 
orders of elements of G (by Theorem 4.7). Also, we claim that |a(d)| = p”. As 
its order must divide p”, suppose that a?” ' € (d). Since a?” ' # e, we must have 
a?” ' =d5,withO <s < p. But then (s, p) = 1, so by Corollary 2.1, there exist 
u,v € Z such that su + pv = 1. Thus, d = d+?" = (d’)"(d?)" = a?" “e € (a), 
giving us a contradiction. Therefore, |a(d)| = p”, as claimed. 

It now follows that a(d) is an element of largest order in M. As M is an abelian 
p-group of smaller order than G, our inductive hypothesis tells us that there is a 
subgroup K of M such that M = N x K, where N is the subgroup of M generated 
by a(d). By Theorem 4.8, K = H/(d), where H is a subgroup of G containing (d). 

We claim that G = (a) x H. Normality is not an issue. Suppose that a’ € (a)NH. 
Then ai (d) € NOK, andas the product N x K is direct, this means that a! (d) = e(d). 
But we demonstrated above that the order of a(d) is p”, which means that p” divides 
i, and therefore a‘ = e. Thus, (a) N H = fe}. 

Now, take any g € G. Then as M = N x K, we have g(d) = xy, for some 
x EN,y € K.Letus write x = a‘(d) and y = w(d), witht € Z and w € H. Then 
g =a'wd', for some/ € Z. Asa! € (a) and wd! € H, we now see that (a)H = G. 
Thus, we have the required direct product, and our proof is complete. 


And now, the payoff for our hard work! 


Theorem 5.3 (Fundamental Theorem of Finite Abelian Groups). Let G be a 
finite abelian group. Then G is the direct product of subgroups, H, x --- x Hy, with 
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each Hj cyclic of order p;', where the p; are (not necessarily distinct) primes, and 
the n; are nonnegative integers. 


Proof. If G is the trivial group, there is nothing to do. Otherwise, by Lemma 5.5, G 
is the direct product of p-subgroups. Therefore, we may as well assume that G is a 
finite abelian p-group. Our proof is by strong induction on |G|. If |G| = 1, again, 
there is nothing to do, so let G be nontrivial and suppose that our theorem holds for 
groups of smaller order. Let a be an element of largest possible order in G. Then by 
Lemma 5.6, G = (a) x H, for some subgroup H. But then |H| = |G|/|a|, so H 
has smaller order, and by our inductive hypothesis, H is a direct product of cyclic 
groups of prime power order. However, (a) is also a cyclic group of prime power 
order, and we are done. 


We can express this slightly differently. 


Corollary 5.1. Let G be a nontrivial finite abelian group. Then G is isomorphic to 


Lyn x Ze Kes Zyr, where the p; are some (not necessarily distinct) primes, 


and the nj are positive integers. 


Proof. Combine Theorems 5.2 and 5.3 with Theorem 4.14. 


Example 5.7. Up to isomorphism, the abelian groups of order 16 are Z16, Zg x Zo, 
Z4 x Z4, Za x Zo x Zo and Z» x Zo x Zo x Zo. 


Example 5.8. Note that U (32) is an abelian group of order g(32) = 16, so it must be 
isomorphic to one of the groups in the preceding example. But which one? Examining 
the orders of the elements, we find that there is no element of order 16, so it is not 
Z16. However, |3| = 8. As none of the other groups in the preceding example have 
an element of order 8, U(32) is isomorphic to Zg x Zp. 


Example 5.9. As 200 = 235°, the finite abelian groups of order 200 are all isomor- 
phic to one of the following, namely Zg x Zy5, Z4 x Zi x Zins, Z2 X LZ X Zy x D5, 
Zg xX Zs x Zs, Zag X Zn x Zs xX Zs and Zy xX Zo x Zo xX Zs x Zs. 


We might be momentarily concerned about the absence of Zy99 in the preceding 
example. However, it is isomorphic to Zg x Zos, as the following theorem shows us. 


Theorem 5.4. Let G = H, x --- x Hy, where each H; is cyclic of order n;. Then 
G is cyclic if and only if (n;,nj) = 1 whenever i ¥ j. 


Proof. Let H; = (a;). Ifthe n; are all relatively prime, then we claim that (a), ... , ax) 
has order n; ---ngz = |G|, and therefore G is cyclic. Suppose that (a),...,a%)” = 
(e,..., e). Then each a” = e, so n;|m. As the n; are relatively prime, n, ---n|m, 


by Corollary 2.3. Since |G| = n,---ng, the largest possible order of an element is 
nj, +--+, and the claim is proved. 

On the other hand, suppose that the n; are not relatively prime. Without loss of 
generality, say that some prime p divides bothn, and n2. Then for any r; € Z, we have 
(aj',...,ajsyrm/P = (e,..., e), since each n; divides ny, ---ny/p. (For i = 1, we 
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have n)(n2/p)n3---nz, and fori > 2, we have (n;/p)n2n3---nx.) Thus, every 
element of G has order dividing n,n. ---n,/p, and therefore there is no element of 
order |G|, so G is not cyclic. 


As a result of our classification, we can prove a special case of a famous result 
due to Augustin-Louis Cauchy. 


Theorem 5.5 (Cauchy’s Theorem for Abelian Groups). Let G be a finite abelian 
group, and suppose that p is a prime dividing |G|. Then G has an element of order 


Pp. 


Proof. If |G| is divisible by a prime, then G is not the trivial group. Letting G be as 
in Corollary 5.1, we see that |G| = p' p5* --- p;‘. If p divides |G], then p = pj, for 
some i. But then G has a subgroup isomorphic to Zp», for some n; > 0. However, 


in Zp«, the element p”—! has order p. The proof is complete. 


Corollary 5.2. A finite abelian p-group has order p", for some n > 0. 


Proof. Let G be a finite abelian p-group. If the corollary is false, then the order of 
G is divisible by g, for some prime q 4 p. But then G has an element of order g, 
which is impossible. 


Exercises 


5.11. Give a list of abelian groups of each of the following orders, such that every 
abelian group of that order is isomorphic to one of the groups in the list. 


1. 21 
2. 81 
3. 9800 


5.12. Give a list of abelian groups of each of the following orders, such that every 
abelian group of that order is isomorphic to one of the groups in the list. 


1. 144 
2. 243 
3. 55125 


5.13. Write U(56) as an external direct product of cyclic groups of prime power 
order, as in Corollary 5.1. 


5.14. Write (Zoo x Ze)/((10, 2)) as an external direct product of cyclic groups of 
prime power order, as in Corollary 5.1. 


5.15. Let p be a prime. Suppose that G is a nontrivial finite abelian group in which 
every element has order | or p. Show that G is isomorphic to a group of the form 
Zp X Zp X +++ X Zp. 


5.2 The Fundamental Theorem of Finite Abelian Groups 93 


5.16. Suppose that n is an integer that is a product of distinct primes. If G is a finite 
abelian group, and |G| is divisible by n, show that G has a cyclic subgroup of order 
n. 


5.17. If (a) is acyclic group of order 35, write a as the product of a 5-element and 
a 7-element. 


5.18. If (a) is a cyclic group of order 90, write a as the product of p-elements, for 
various primes p. 


5.19. Prove Theorem 5.5 in a different way, as follows. Let p be a prime dividing 
|G|. Show that G has an element a of some prime order, say g. If g = p, we are 
done. Otherwise, what can be said about G/(a)? Complete the proof. 


5.20. Let G be a finite abelian group and let n be a positive integer dividing |G]. 
Show that G has a subgroup of order n. 


5.3 Elementary Divisors and Invariant Factors 


For any positive integer 1, we now know all possible abelian groups of order n, up to 
isomorphism. Indeed, we determine the prime factorization of n, and then proceed 
as in Examples 5.7 and 5.9. But we have not yet made certain that the groups we 
found are not isomorphic to each other. Let us work on that. 


Definition 5.4. Let G be a nontrivial finite abelian group, and say that G = Hy x 
Hy x +++ x Hy, where each H; is cyclic of order p;", for some prime p; and positive 
integer n;. Then the elementary divisors of G are the numbers pj’, p3’,..., Pz‘; 
where the order in this list is irrelevant, but each number must be listed as many 


times as it occurs. The trivial group has no elementary divisors. 
Example 5.10. The elementary divisors of Zo x Zo x Z3 x Zy25 are 9,9, 3, 125. 


Example 5.11. To find the elementary divisors of Z399 x Z3, we use Theorem 5.4 to 
see that the group is isomorphic to Z25 x Z4 x Zz x Zs, so the elementary divisors 
are 4, 3, 3, 25. 


Definition 5.5. Let G be an abelian group and n a positive integer. Then we write 
G" = {a":aeG}. 


Lemma 5.7. Let G and H be abelian groups and n a positive integer. Then 


I. G" is a subgroup of G; and. 
2. ifa: G — FH is an onto homomorphism, then a(G") = H". 


Proof. (1) See Exercise 3.40. 

(2) If g” € G”, then a(g”) = (a(g))” € H”. Also, if h” € H”, then as a is onto, 
write h = a(g), with g € G. Then h” = (a(g))” = a(g”) € a(G"), completing the 
proof. 
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The elementary divisors are very important, as they uniquely determine a finite 
abelian group, up to isomorphism. 


Theorem 5.6. Let G and H be finite abelian groups. Then G and H are isomorphic 
if and only if they have the same elementary divisors. 


Proof. If G and H have the same elementary divisors, then each is isomorphic to 
a direct product of cyclic groups, and the groups appearing in the direct product in 
G have the same order as those appearing in H, so they are isomorphic. (We must 
be a bit careful, as the cyclic groups may appear in a different order in the direct 
product, but M x N is always isomorphic to N x M, so this is not a problem. See 
Exercise 4.36.) Note that if neither G nor H has any elementary divisors, then each 
is the trivial group, so they are isomorphic. 

On the other hand, let a : G — H be an isomorphism. Take any prime p. Now, 
by Lemma 5.3, the p-elements of G form a subgroup, as do those of H. Furthermore, 
as isomorphisms preserve the orders of group elements, w provides an isomorphism 
from one of these p-subgroups to the other. As the elementary divisors come from 
these p-subgroups, we may as well assume to begin with that G and A are both 
p-groups. We proceed by strong induction on |G|. If |G| = 1, then G and H are both 
the trivial group, so neither has elementary divisors. Therefore, assume that |G| > 1 
and the result holds for groups of smaller order. 

In particular, say G = G, x --- x G; and H = H, x--- x H), where G; = (g;) 
is cyclic of order p”, and H; = (h;) is cyclic of order p”'. Rearranging the terms if 
necessary, we may assume thatn, > ny >--- > ny > Oandm, > m2, >--->m,> 
0. By the above lemma, a(G?) = H?. Thus, a(G? x --» x G?) = H? x---x H?. 
But G? = (g?), and since |g;| = p”, we have |g?| = p™—', by Corollary 3.2. 
Similarly, |A?| = p”—'. Thus, G? is a p-group of strictly smaller order than G, and 
by our inductive hypothesis, the elementary divisors of G? and H? are the same. 


But the elementary divisors of G? are p"™~', p"~',..., p”~', where n, > 1 but 
n, = 1 whenever u > r. (When n, = 1, we have p™—! = 1, which does not count 
as an elementary divisor. If n,; = 1, then G” has no elementary divisors.) Similarly, 
the elementary divisors of H? are p™7~',..., p"—!, where m, > 1 but m, = 1 


whenever v > s. Therefore, r = s and m; — 1 = n; — 1 wheneveri < r. But then 
m; = n;,foralli < r. Also,n; = 1 foralli > r andm; = 1 foralli > s. In order to 
prove that G and H have the same elementary divisors, it remains only to show that 
k =1. But |G| = p™ --- p" p*" and |H| = p™--- p”™ p'-". As isomorphic groups 
have the same order, p*~" = p'~", and therefore k = 1. If G? has no elementary 
divisors, then neither does H”, and we simply get p* = p', hence k = 1. 


Example 5.12. The five abelian groups of order 16 listed in Example 5.7 are all non- 
isomorphic, as they have different elementary divisors. Similarly for the six abelian 
groups of order 200 given in Example 5.9. 


Example 5.13. Let G = Zno9 X Zg X Ze, H = Zy2 x Zio X Za X Zo and K = 
Z5 X Zo4 X Zg X Zp. These are all abelian groups of order 9600. However, using 
Theorem 5.4, we see that G is isomorphic to Zg x Zo5 x Zg x Z3 x Zo, so its 
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elementary divisors are 8, 8, 2,3, 25. Similarly, H is isomorphic to Z3 x Zg x Zs x 
Zs X Za X Zag X Zz, 8o its elementary divisors are 8, 4, 2, 2, 3, 5, 5 and K is isomorphic 
to Zo5 x Z3 x Zg x Zg x Zo, so its elementary divisors are 8, 8, 2, 3, 25. Therefore, 
G and K are isomorphic, but H is not isomorphic to either of them. 


There is another interesting way to express a finite abelian group as a direct product 
of cyclic groups. 


Theorem 5.7 (Invariant Factor Decomposition). Suppose that G is a nontrivial 
finite abelian group. Then G = H, x Hp x --- X Hy, where each H; is a cyclic 
subgroup of G of order m;, with m, > 1 and m;|mj+4\, for 1 <i <k. 


Proof. We will explain how to construct the H;, assuming that G has been expressed 
as a direct product of cyclic groups of prime power order, as in coenay 5.1. Let 
Pi,---, p, be the primes dividing |G|. For each j, find the largest power P; ’ such that 


Ny nz 


Z, si appears in Corollary 5.1. Letting my = p;' p,”--- p?’, Theorem 5. 4 says that 


H. = Z ph ere x Zr is isomorphic to Zm,. Now, delete all of the terms from the 
direct aa in Corollary 5.1 that we have used Cuan only one copy, if multiple 
copies of the same group appear). For each j, let p; j_ be the largest power appearing 
in the remaining terms (where s; = 0 is entirely possible). Let my_, = pj! --- py". 
By construction, each s; < nj, so my_;|m,y. Again, Ay; = ZL yi KX Lyx is 
isomorphic to Z,,,_,. Delete all of these terms that we have just used, and repeat until 
we exhaust the entire direct product in Corollary 5.1. 


Definition 5.6. If G is isomorphic to Z,,, x --- * Zm,, where m, > 1 and m;|mi+1, 
for 1 <i < k, then the numbers m,..., m, are called the invariant factors of G. 


Example 5.14, Letus use our work in Example 5.9 to find the invariant factors of the 
abelian groups of order 200. We apply the method from Theorem 5.7. Considering 
Za X Zz x Zs, we see that the highest power of 2 that appears is 4, and the highest 
power of 5 is 25. Therefore, m, = 4-25 = 100. Deleting Z,4 and Zs, we are left with 
Zo, SO Mg_| = 2, and we are finished. Thus, our group is isomorphic to Zz x Zi 0, 
so the invariant factors are 2, 100. When we examine Z x Z» x Zo x Zs x Zs, we 
see that m, = 2-5 = 10. Deleting Z, and Zs, we are left with Z, x Z) x Zs. Thus, 
mg; = 2-5 = 10. Deleting Zz and Zs, we are left only with Z). Thus, m,z_2 = 2, 
and we are finished. Therefore, our group is isomorphic to Z2 x Zio x Zo, which 
gives invariant factors of 2, 10, 10. Considering Zg x Z2s, we simply get Z299, so 200 
is the only invariant factor. Looking at Z2 x Zz x Zz x Zos5, we have Zz x Zp x Zs0, 
so the invariant factors are 2, 2,50. When we examine Zg x Zs x Zs, we obtain 
Zs X Zag, So the invariant factors are 5, 40. Finally, if we take Z4 x Zo x Zs x Zs, 
then we get Zio x Z9, so the invariant factors are 10, 20. 


In the above example, the nonisomorphic groups produced different lists of invari- 
ant factors. As it turns out, this always happens. 


Theorem 5.8. Let G and H be nontrivial finite abelian groups. Then G and H are 
isomorphic if and only if they have the same invariant factors. 
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Proof. Let G be isomorphic to Z,, X-++ xX Zm, Withm, > Landm;|mjii,1 <i <k. 
Similarly, write H as Z,, x +--+ x Zy,, withn; > 1 andn;|njqi, 1 <i <1. If G and 
H have the same invariant factors, then they are both isomorphic to the same direct 
product, and therefore to each other. 

On the other hand, suppose that G and H are isomorphic. We will show that they 
have the same invariant factors. Our proof is by strong induction on |G|. If |G| = 2, 
then the only possible invariant factor list is 2 for both G and H, so there is nothing 
to do. Assume that |G| > 2 and that the result is true for groups of smaller order. If 
we take (g1,..., 8x) € G, then each g; has order dividing m,, and therefore all g; 
have order dividing m;. On the other hand (0, 0,..., 0, 1) has order mz. Thus, mx 
is the largest possible order of an element of G. Similarly, n; is the largest possible 
order of any element of H. Therefore, as isomorphisms preserve orders of group 
elements, m, = n,;. Now, expressing each m; as a product of prime powers, we note 
that the elementary divisors of G are those that come from Z,,, x --- x Zm,_, together 
with those from Z,,,. Similarly, the elementary divisors of H are those coming from 
Zn, X +++ X Zy,_, together with those from Z,, = Z,,. As G and H are isomorphic, 
Theorem 5.6 tells us that they have the same elementary divisors. Deleting those from 
Zm,, the groups Zn, X ++: X Zm,_, and Zn, X +--+ X Zy,_, have the same elementary 
divisors. Thus, by Theorem 5.6, these groups are isomorphic. As they have smaller 
order than G, our inductive hypothesis tells us that k — 1 = 7 — 1 and each m; = nj. 
Therefore, the invariant factors are identical. 

(We have to be a bit careful if either k = 1 or / = 1, as then we have nothing left 
when we remove the term Z,,, or Z,,. But in this case, comparing orders, we must 
have k = / = 1, and the only invariant factor is m, for both groups.) 


Exercises 


5.21. Find the elementary divisors for each of the following groups. 


1. Zaz x Zar00 
2 Ze x Zig x Zs4 


5.22. Find the invariant factors for each of the following groups. 


1. Z3 X Z3 x Ly X Zos5 x ZX Zy21 
2: Za X Zg xX Zg X Zio xX Zs X Los X Lay 


5.23. Let p,q andr be distinct primes. Give the list of elementary divisors for every 
possible abelian group of order p*g7r. 


5.24. Let p, q and r be distinct primes. Give the list of invariant factors for every 
possible abelian group of order p*g7r. 


5.25. For which positive integers n are all abelian groups of order n isomorphic? 


5.26. Find the smallest positive integer n such that there are exactly four noniso- 
morphic abelian groups of order n. 
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5.27. Let G,, G2 and G3 be finite abelian groups, and suppose that G; x G2 is 
isomorphic to G; x G3. Show that Gz and G3 are isomorphic. 


5.28. Let a finite abelian group G have invariant factors n,,12,...,,. What are 
the invariant factors of G x G? 


5.29. Let G be a nontrivial finite abelian 2-group. Show that the number of elements 
of order 2 in G is 2* — 1, for some positive integer k. 


5.30. Let G be a finite abelian group. Suppose that, for every n € N, there are at 
most 1 elements a € G satisfying a” = e. Show that G is cyclic. 


5.4 A Word About Infinite Abelian Groups 


Unfortunately, that word is “messy”. We have seen that finite abelian groups behave 
very nicely. To be sure, we cannot possibly expect every infinite abelian group to be 
a direct product of cyclic groups of prime power order. But even if we allow direct 
products of infinite cyclic groups such as Z x Z, that does not come close to covering 
all of the possibilities. While a deep discussion of infinite abelian groups is beyond 
the scope of an introductory abstract algebra course, we can make a few remarks. 


Definition 5.7. Let G be a nontrivial group. We say that G is decomposable if it is 
the direct product of two proper subgroups. If not, then it is indecomposable. 


We can easily classify the indecomposable finite abelian groups. 


Theorem 5.9. Let G be a finite abelian group. Then G is indecomposable if and 
only if G is acyclic group of order p", for some prime p and positive integer n. 


Proof. In view of Theorem 5.3, an indecomposable finite abelian group must indeed 
be cyclic of prime power order. If G is cyclic of order p”, then suppose that G = 
H x K, for some subgroups H and K. Then by Lagrange’s theorem, H and K 
are both p-groups. Furthermore, by Theorem 3.16, they are both cyclic. But since 
G = H x K is cyclic, it follows from Theorem 5.4 that (|H|,|K|) = 1. As the 
orders are both powers of p, this means that either H or K is trivial, so either K or 
His all of G. Thus, H and K are not both proper and G is indecomposable. 


What about infinite abelian groups? 


Example 5.15. The additive group Q is indecomposable. Indeed, suppose that Q = 
H x K, where H and K are proper subgroups. Then neither H nor K is {0}, so 
take a/b € H, c/d € K, where a, b, c and d are nonzero integers. Note that 
bc(a/b) = ac € H andad(c/d) = ac € K. Then HN K is not trivial, so we do not 
have a direct product. Also, Q is not cyclic. Indeed, if a, b € Zand b > 0, it is clear 
that 1/(b + 1) ¢ (a/b). Thus, Q F (a/b). 
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Now, every element of Q other than the identity has infinite order. What about 
infinite abelian groups where every element has finite order? 


Example 5.16. Consider the group Q/Z. Exercise 5.31 asks us to examine some 
properties of this group. In particular, the distinct elements of the group are precisely 
of the form q + Z, where g € Qand0 < q < 1. Also, every element has finite order. 
But this group is decomposable. Indeed, fix any prime p. Then let H = {a/b+Z: 
a,béZ,b= p",n => O}and K = {c/d+Z: c,d € Z, (d, p) = 1}. In Exercise 
5.32, we also demonstrate that Q/Z = H x K. 


The group H from the preceding example is named for E.P. Heinz Priifer. 


Definition 5.8. Let p be a prime. Then the Priifer p-group is the subgroup {a/p” + 
Z:a,n € Z,n => 0} of the additive group Q/Z. 


Example 5.17. Let H be the Priifer p-group. We note that H is an abelian p-group; 
indeed, p"(a/p"+Z) =a+Z=0+2Z,; thus, the order of a/p" + Z divides p”. But 
H is not cyclic; indeed, 1/p” + Z has order p”, so H has elements of arbitrarily large 
order. So if it were cyclic, what order could its generator possibly have? However, 
Exercise 5.36 asks us to show that every nontrivial subgroup of H contains 1/p+ Z. 
Thus, H is surely indecomposable. 


In fact, Q and the Priifer p-group share another interesting property. 


Definition 5.9. Let G be an abelian group written additively. We say that G is 
divisible if, for every element a of G and every positive integer n, there exists a 
b € G such that nb = a. 


Note that if G is a nontrivial finite abelian group, then it cannot be divisible. 
Indeed, if G has order n, then nb = 0 for every b € G. Thus, if 0 #4 a € G, then 
nb = a has no solution. So, we must look to infinite abelian groups. 


Example 5.18. The group Q is divisible. Indeed, if a € Q and n is a positive integer, 
then n(a/n) =a. 


Example 5.19. For any prime p, the Priifer p-group is divisible. Indeed, to see this, 
we note that if G is divisible, so is any factor group of G. (See Exercise 5.35.) Thus, 
Q/Z is divisible. As in Example 5.16, write Q/Z = H x K, where H is the Priifer 
p-group. If a € H, then by the divisibility of Q/Z, for any positive integer n, there 
existh € H,k € K such that n(h, k) = (a, 0). But then nh = a. 


Exercises 


5.31. Let G = Q/Z. 


1. Show that the elements of G can be uniquely written in the form g + Z, where 
q € Qand0 <q <1. 
2. Ifa,b € Z, b > 0 and (a, b) = 1, what is the order of a/b + Z in G? 
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5.32. Show that for any prime p, Q/Z = H x K, where H is the Priifer p-group 
and K = {c/d+Z:c,d €Z, (d, p) = \}. 


5.33. Let G be a divisible group, written additively. Show that for every positive 
integer n, the function a : G > G given by a(a) = na is an onto homomorphism. 
Is it necessarily an automorphism? 


5.34. Let G and H be abelian groups, written additively. Show that G x H is 
divisible if and only if G and H are both divisible. 


5.35. Show that if G is a divisible group, then every factor group of G is divisible, 
but subgroups need not be. 


5.36. Let G be the Priifer p-group, for some prime p. Show that every nontrivial 
subgroup of G contains 1/p + Z. 


5.37. Let G be an abelian group having a subgroup N such that G/N is infinite 
cyclic. Show that G has a subgroup H such that H is infinite cyclicandG = Hx N. 


5.38. For any prime p, show that every proper subgroup of the Priifer p-group is 
finite. 


Chapter 6 Mm) 
Symmetric and Alternating Groups cro 


We have seen the definition of the symmetric group S,,, but so far, we do not have too 
much experience with it. In this chapter, we will introduce the notions of cycles and, 
in particular, transpositions, which are important elements of the symmetric group. 
These will help us to understand the group. 

We will also construct a subgroup of the symmetric group called the alternating 
group. If n > 5, then the alternating group is very special in that it has no nontrivial 
proper normal subgroups. 


6.1 The Symmetric Group and Cycle Notation 


Let n be a positive integer. Then we recall that the set of permutations of the set 
{1,2,...,} is a group of order n! under composition of functions. It is called the 
symmetric group and denoted S,,. Why is this group of sufficient interest to merit a 
chapter on its own? In the earliest years of group theory, the abstract definition of a 
group had not been written down. Instead, mathematicians worked with groups of 
permutations. As it turns out, they were not losing much by doing so! If A is any 
nonempty set, write P(A) for the set of all permutations of A. Then just as we saw 
that S, is a group under composition of functions, so is P(A). The following famous 
result is due to Arthur Cayley. 


Theorem 6.1 (Cayley’s Theorem). Let G be any group. Then G is isomorphic to 
a subgroup of P(G). 


Proof. For each a ¢€ G, define pg : G — G via pa(g) = ag, for all g € G. We 
claim that p, € P(G). Certainly p,(g) € G. If pa(gi) = Pa(g2), for g1, g2 € G, 
then ag) = ago, SO g} = go. Thus, p, is one-to-one. If g € G, then p,(a~'g) = g, 
SO Pq is also onto. The claim is proved. 
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Now, define p : G > P(G) via p(a) = Pq. We claim that p is ahomomorphism. 
If a,b € G, then p(ab)(g) = Pas(g) = abg and (p(a) © p(b))(g) = Pa(n(g)) = 
Pal(bg) = abg, forall g € G. Thus, p(ab) = p(a)o p(b), proving the claim. Also, if 
a € ker(p), then p, is the identity permutation. In particular, p,(e) = e, and therefore 
ae = e. Thus, a = e, and 9 is one-to-one. It now follows that G is isomorphic to 
pe(G), which is a subgroup of P(G). oO 


Corollary 6.1. Let G be a group of order n < oo. Then G is isomorphic to a 
subgroup of Sy. 


Proof. We know that G is isomorphic to a subgroup of P(G), but replacing G with 
{1,2,..., } is just a relabelling. Thus, G is isomorphic to a subgroup of S),. Oo 


The notation we have been using for elements of S, is rather cumbersome and 
tends to hide the properties of the permutations. It is time to introduce something 
better. 


Definition 6.1. Let k be a positive integer. A permutation o € S, is called a k-cycle 


if there exist distinct elements a), d2,..., ax € {1,2,...,} such that o(a;) = aj+1, 
forl| <i <k,o(a,) =a, andifa ¢ {a,,..., ay}, then o(a) = a. We use the cycle 
notation o = (a; a2 --- az). A cycle means a k-cycle for some k. 


: 12345 
Example 6.1, Let us work in Ss. Then o = 6 5324 
o(5) = 4, o(4) = 2 and everything else is fixed, we have o = (2 5 4). Note that it 
would be just as correct to write o = (5 4 2) or (4 2 5) (but not (2 4 5)). Similarly, 
T= ; i.) satisfies T(1) = 3, t(3) = 2, T(2) = 5, t(5) = 4, T(4) = 1, and 
there are no other values to consider, so tT is the 5-cycle (1 3 25 4) (or 3 25 4 1), 
and so on). 


is a 3-cycle; as 0 (2) = 5, 


Note that the only 1-cycle in S, is the identity permutation, denoted (1). 


Theorem 6.2. Any k-cycle in S,, has order k. 


Proof. Simply note thatifo = (a, --- ax), thenao(a)) = ap, o*(a}) = 0 (a2) = 4, 
and so on. It takes k steps to reach a; again. Similarly for all other q;. oO 
Definition 6.2. We say that cycles o;,..., 0, are disjoint if, whenever o;(a) 4 a, 


we have o;(a) = a forall j Ai. Ifo € S, and we write 0 = 0,02 ---o,, where the 
o; are disjoint cycles, then we have a disjoint cycle decomposition for o. 

: : : 7 : in Ss. Then noting that 0(1) = 5, 0(5) = 4 
and o(4) = 1, we have a cycle (1 5 4). Also, o(2) = 6 and o(6) = 2, so we 
have another cycle (2 6). The remaining number, 3, is fixed by o, so a disjoint cycle 
decomposition for o iso = (15 4)(2 6). 


Example 6.2. Leto = 
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12345678 
21568437 
dure as above, we find thato = (1 2)(3 5 8 7)(4 6) is a disjoint cycle decomposition. 


Example 6.3. Similarly, consider 0 = ( ) Using the same proce- 


In fact, we can always apply the procedure from the last two examples. 
Theorem 6.3. Every element of S;, is a product of disjoint cycles. 


Proof. Take any o € S,. If o is the identity, then o = (1) and there is nothing 
to do. Assume otherwise, and take a; € {1,...,} such that o(a,) = a2 4 a). If 
o (a2) = aj, then we have a 2-cycle, (a; az). Otherwise, let o (a2) = a3. Continue 
until we find a, such that o(a,) € {a,,..., ax}. Now, if o(a,) = a;, with 1 <i <k, 
then o*(a,) = o'~!(a,). Thus, o*-'+!(a,) = ay. In other words, ay_j42 = a). But 
this is a contradiction. Therefore, o (a,) = a;, and we have ak-cycle, (a; dz --- ax). 

If o = (aq a2 --- a,x), then we are done. Otherwise, take b; which is not in 
{a1,..., ax} such that o(b,) = by  b. Now repeat the same procedure, obtaining 
an I-cycle, (b; bz --- b;). We must make sure that these cycles are disjoint; that 
is, we cannot have b,, € {a),..., ax}, for any m. By choice, b; ¢ {a),..., dy}. If 
by = o(b,) = a,, then since a; = o(a;), for some s, we have ao (b,) = o(a;), and as 
o is one-to-one, b; = a;, which is impossible. Proceeding in this way, we see that 
the cycles are disjoint. 


Ifo = (a --- a)(h) --- by), then we are done. Otherwise, take any c, that 
does not lie in {a,,..., ax, bj, ..., by} such that o(c,) ~ c; and repeat. As there are 
only n entries in {1,..., m}, this procedure must stop eventually. Oo 


We were not too concerned about the order in which we wrote the cycles in the 
last proof. But this is ok. 


Theorem 6.4. Jn S,, disjoint cycles commute. 


Proof. Leto = (a, --- a) andt = (bj --- by) be disjoint cycles. We will 
show that ot = to. Takec € {1,...,n}. If c € {a,,...,a,}, then as o and T are 
disjoint, t fixes c. Thus, ot(c) = o(c). But o(c) € {a),..., ax} as well. Thus, t 
fixes o(c) too, so ta(c) = a(c). By a similar argument, if c € {bj,..., b,}, then 
ot(c) = to(c) = T(c). If c is not among the a; or b;, then both o and Tt fix c, so 
ot(c) = to(c) = c. We are done. oO 


Example 6.4. It makes no difference if we write (1 5)(2 6 4) or (2 6 4)(1 5). Both 
are the same permutation. 


However, it would be wrong to try to extend this to cycles that are not disjoint! 


Example 6.5. In S3,leto = (1 2) andt = (1 3). Letus compute or. Now, t(1) = 3 
and 0 (3) = 3, so ot (1) = 3. Also, t(3) = 1 and o(1) = 2, soort(3) = 2. Finally, 
t(2) = 2 and o(2) = 1, soot(2) = 1. There are no other values to consider, so oT 
is the 3-cycle (1 3 2). But proceeding in the same way, we find that to is a different 
3-cycle, (1 2 3). 
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Example 6.6. Letus finda disjoint cycle decomposition for (2 4)(2 5 3 4)(1 3)(1 5). 
We see (working from right to left) that 1 is mapped by (1 5) to 5, which is fixed by 
(1 3), which then goes to 3, which is fixed by (2 4). So, 1 goes to 3. Next, 3 is fixed 
by (1 5), then goes to 1, which is fixed by the other cycles, so we have a 2-cycle 
(1 3). Next, 2 is fixed by (1 3)(1 5), it then goes to 5, which is fixed by (2 4), so 
2 goes to 5. Now, 5 goes to 1 which goes to 3 which goes to 4 and then back to 2. 
Thus, we have another 2-cycle, (2 5). Finally, 4 goes to 2 then back to 4, so 4 is 
fixed. Therefore, we have (2 4)(25 3 4)(1 3)(1 5) = C1 3)(2 5). 


We can use the disjoint cycle decomposition to find the order of a permutation. 
Recall that the least common multiple of positive integers a), d2,...,a, is the 
smallest positive integer m such that a;|m for all 7. 


Theorem 6.5. If o,,...,0, are disjoint cycles in S,, then the order of o,--+- 0, is 
the least common multiple of the lengths of the o;. 


Proof. Let k be a positive integer. Then since the o; commute, by Theorem 6.4, we 


have (0; ---0,)k = of tee a. As the o; move disjoint subsets of {1, ..., 2}, we have 
of oe ok = (1) if and only if each of = (1). In view of Theorem 6.2, this occurs if 
and only if the length of each o; divides k. Oo 
Exercises 


6.1. Write each of the following permutations as a product of disjoint cycles. 
I 1234567 
“\1457326 


> 12345678 
"\25631487 


6.2. Write each of the following permutations as a product of disjoint cycles. 


1. (132) 4)(25 3) 
2. (2534)126)354)0 27) 


6.3. Find the inverse of each of the following permutations. Write the answer as a 
product of disjoint cycles. 


1. 224)8576) 
2. (1 2)(2 4 3)(23 5) 


6.4. Find all possible orders of elements of $7. 
6.5. How many elements of order 3 are there in So? 


6.6. Let o be a k-cycle. If m is a positive integer, show that 0” is a k-cycle if and 
only if (k,m) = 1. 


6.7. Leto € S, be a k-cycle. Show that there exists a k-cycle t € S, such that 
t* =o if and only if k is odd. 
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6.8. Ifn # 2, show that Z(S,) = {(1)}. 


6.9. Find the smallest positive integers m and n such that S,,, has an element of order 
105 and S, has an element of order 125. 


6.10. Find a subgroup of order 120 in Sg. 


6.2 Transpositions and the Alternating Group 


While a disjoint cycle decomposition gives us the clearest picture of the action of a 
permutation, it is often useful to write the permutation as a different sort of product. 


Definition 6.3. A transposition is a 2-cycle. 
Theorem 6.6. [fn > 2, then every permutation in S, is a product of transpositions. 


Proof. In view of Theorem 6.3, it is sufficient to show that every cycle is a product of 
transpositions. The identity is (1) = (1 2)(1 2). Let us take a k-cycle o; without loss 
of generality, sayo = (123 --- k). Weclaim thato = (1 k)(1 (k — 1))--- C1 2). 
Our proof is by induction on k. If k = 2, there is nothing to do. Otherwise, assume 
that(1 2 ---k) = (1k) (K—1))---C 2). Then 


(k+ DAK 0 2)=0 E+))02 + 


and performing the calculation, we see that this is (1 2 --- (k + 1)), as required. O 


Example 6.7. Let us write (1 4 5)(1 3 6 4 5) as a product of transpositions. Using 
the method described in the above proof, 


d45=da5d4) 


and 
(13645)=(15)0 4)(1 6) 3), 


so 
245)13645)=(50d 4d 5d 4d 6)C 3). 


It is worth noting that the expression of a permutation as a product of transpositions 
is by no means unique. For instance, we have seen that (1 2 3 4) = (1 4)(1 3)C1 2). 
But also, (1 2 3 4) = (1 2)(2 3)G 4). In fact, the number of transpositions involved 
does not have to be the same, as both of these are equal to (5 6)(1 2)(2 3) 4)(5 6). 

Nevertheless, we note that all of the products we have just calculated involve 
an odd number of transpositions. It is a very useful fact that this parity is always 
preserved; that is, a permutation will be a product of either an even or an odd number 
of transpositions, not both. The following lemma does most of the work in proving 
this fact. 
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Lemma 6.1. Jn S,, the identity permutation cannot be written as a product of an 
odd number of transpositions. 


Proof. Suppose that the lemma is false, and let k be the smallest odd number such 
that (1) = o,02--- 0%, where each o; is a transposition. Now, choose an element of 
{1,...,} that is not fixed by all of the o;. Without loss of generality, let us say that 
some o;(1) € 1. Let j be such that oj(1) € 1 buto,(1) = | forallr > j. Among 
all expressions of (1) as a product of k transpositions such that at least one does not 
fix 1, we proceed by induction on /. If j = 1, then we note that o2 - - - ox fixes 1, but 
o| does not, so oj --- ox does not fix 1, which is a contradiction. 

Therefore, assume that 7 > 1 and that our result holds for expressions with a 
smaller j value. Without loss of generality, say that 0; = (1 2). We have four cases 
to consider for oj_;. If oj_; = (1 2), then since (1 2)(1 2) is the identity, we can 
cancel it from our expression. But this contradicts the minimality of k. 

Suppose that o;_; fixes 1 but not 2. Without loss of generality, say oj; = (2 3). 
Then notice that (2 3)(1 2) = (1 3 2) = (1 3)(2 3). Thus, replacing oj_,;0; with 
(1 3)(2 3), we see that the 7 value has now decreased to j — 1. By our inductive 
hypothesis, it is impossible to write the identity as a product in this way. 

Suppose that o;_; fixes 2 but not 1. Without loss of generality, say o;_; = (1 3). 
Then we see that (1 3)(1 2) = (1 23) = (1 2)(2 3). Again, replacing o;_\0; with 
(1 2)(2 3), the 7 value decreases, and we have a contradiction. 

Finally, suppose that oj;_; fixes both 1 and 2. Without loss of generality, say 
o;-1 = (3 4). Then by Theorem 6.4, (3 4)(1 2) = (1 2)(3 4), so we can once again 
decrease the j value. Our proof is complete. Oo 


Theorem 6.7. No permutation in S, can be written as a product of both an even 
and an odd number of transpositions. 


Proof. Suppose that 
O102°**OK = TT2°** Tm; 


where each o; and 7; is a transposition, k is even and m is odd. Then 
ae. -1 _ 
(1) =o, 7650, Tltt+ Tm = O° ++ O1T1 +++ Tm, 


since each o; has order 2 (by Theorem 6.2) and is therefore its own inverse. Thus, 
we have written the identity as a product of k + m transpositions. But k + m is odd, 
contradicting the preceding lemma. Oo 


Definition 6.4. We say that a permutation in S, is even (respectively, odd) if it is 
the product of an even (respectively, odd) number of transpositions. 


Example 6.8. In Ss, we note that (1 2 3)(4 5) is odd, as (1 2 3)(4 5) = 
(1 3)(1 2)(4 5). 


Theorem 6.8. A k-cycle is even if and only if k is odd. 
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Proof. If k = 1, then we know that the identity is even. If k > 1, then refer to the 
proof of Theorem 6.6, where we wrote a k-cycle as a product of k — 1 transpositions. 
| 


Thus, to determine if a particular permutation is even or odd, we can look at its 
disjoint cycle decomposition. The preceding theorem tells us whether each cycle is 
a product of an even or odd number of transpositions, so we can easily determine the 
answer for the entire permutation. 


Definition 6.5. The alternating group A, is the set of all even permutations in S,. 


Example 6.9. We note that $3 consists of the identity (which is even), three trans- 
positions (which are odd) and two 3-cycles (which are even). Thus, 


A3 = {(1), Cd 2 3), (1 3 2)}. 


Similarly S4 consists of the identity (even), six transpositions (odd), eight 3-cycles 
(even), six 4-cycles (odd) and three elements that are products of two disjoint trans- 
positions (even). Thus, 


Ag = {(1), Cd 2 3), 1 24), 1 3 2), 1 3 4), C1 4 2), 1 4 3), 2 3 4), 2 43), 
(1 2)3 4), d 3) 4), d 4) 3)}. 


Theorem 6.9. Let n > 2. Then A, is anormal subgroup of Sy, and [Sy : An] = 2. 


Proof. Define a : S, — Zp as follows. Let a(o) = 0 if o is even and | if o is 
odd. We claim that a is a homomorphism. Indeed, as the product of two even or two 
odd permutations is even, and the product of an even and an odd is odd, this follows 
immediately. By definition, the kernel is A,,,so A, is anormal subgroup. Furthermore, 
a((1)) = Oanda((1 2)) = 1, soa is onto. Thus, by the First Isomorphism Theorem, 
S,/An 1s isomorphic to Z2. That is, |S,/An| = 2, so A, has index 2. oO 


Exercises 


6.11. Decide if each of the following permutations is even or odd. 


1. (23) 34) 42 3) 
2. (1435) 2) 3 2 4) 


6.12. Write each of the following permutations as a product of transpositions. 


1. (132) 4)(2 5 3) 
2. (2534)126)35 4) 


6.13. Find every possible order of the product of two transpositions. 


6.14. Letn > 2 and H < S,. Show that either every element of H is even, or 
exactly half of the elements of H are even. 
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6.15. For which n > 2 does A, have a subgroup of order 4? What if we insist that 
the subgroup be cyclic? 


6.16. Find the orders of all the elements of Ag. 
6.17. If n > 2, show that every element of odd order in S,, lies in Ay. 


6.18. Show that every permutation other than the identity in S,, is the product of at 
most n — | transpositions. 


6.19. For which positive integers n does S,, have 


1. more elements of even order than odd order; 
2. more elements of odd order than even order; 
3. the same number of elements of odd order as even order? 


6.20. For which integers n > 2 does there exist ao € A, such that |o| > n? 


6.3 The Simplicity of the Alternating Group 


Why are we so interested in the group A,,? In order to explain this, we must start 
with a definition. 


Definition 6.6. A group is simple if it is nontrivial and has no nontrivial proper 
normal subgroups. 


If G is abelian, then every subgroup is normal, so we are looking for groups whose 
only subgroups are G and {e}. But these were determined in Exercise 3.52. Indeed, 
we saw that these were precisely the cyclic groups of prime order. By Theorem 4.14, 
we have the following result. 


Theorem 6.10. Let G be an abelian group. Then G is simple if and only if G is 
isomorphic to Zp, for some prime p. 


That was pretty painless! However, the nonabelian case is much much more diffi- 
cult. Much! The classification of all of the finite simple groups was one of the biggest 
mathematical projects of the twentieth century. Over one hundred mathematicians 
contributed to the solution, and the proof consists of many thousands of pages of 
journal articles. For obvious reasons, we will not be discussing this classification 
here. 

We will content ourselves with proving one of the earliest results on the subject; 
namely, if n > 5 then A,, is a nonabelian simple group. (Actually, As is the smallest 
nonabelian simple group.) The n = 5 case was established by Evariste Galois in the 
early nineteenth century. Decades later, M.E. Camille Jordan provided a proof for 
alln > 5. 
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Why are finite simple groups so interesting? Let us look at it this way. Suppose that 
G is a nontrivial finite group. Let N; be a proper normal subgroup of largest order in 
G. (If G is simple, this will be {e}. Otherwise, it will be something larger.) Now, we 
claim that G/N is simple. Indeed, by Theorem 4.8, the normal subgroups of G/N, 
are precisely of the form H/N,, where H is a normal subgroup of G containing N). 
But by definition of Nj, H = N, or G. Thus, G/N, has no nontrivial proper normal 
subgroups, so it is simple. 

Now, suppose that N,; ~ {e}. Then in the same way, take a proper normal sub- 
group N» of N; of largest possible order. Then Nj; /N> is simple. We can repeat this 
procedure and obtain 


G=N)>N,>N2.>N3>--- > Ne-1 > Me = {fe}, 


where each Nj;+ is normal in N; and N;/N;+1 is simple. We know the process must 
end, as each N;+; is properly contained in N;, and the original group is finite. In a 
way, then, finite groups can be built up using simple groups. 

Let us begin the process of proving that A, is simple, forn > 5. We start with a 
general fact about the conjugation of cycles. 


Lemma 6.2. Let o = (a, a2 --- ax) be ak-cycle in S,. If tT € Sy, then tot! = 


(t (a) T(az) +++ T(ax)). 


Proof. Suppose that b = t(a;). Then t~'!(b) = a;; hence, o(t~!(b)) = a(a;) = 
i+1 (or a, if i = k). Therefore, tot~!(b) = t(aj+1) (or tT (ay), if i = k). That is, 
tot! permutes the t(a;) as described. If b is not among the t(a;), then t~!(b) is not 
equal to any a;, which means that it is fixed by o. Thus, tot~!(b) = tr~!(b) = D. 
Therefore, tot! is the k-cycle described in the statement of the lemma. Oo 


Corollary 6.2. Letn and k be positive integers with n > k. Then 


I. any two k-cycles are conjugate in S,; and 
2. ifk is odd andn > k + 2, then any two k-cycles are conjugate in Ay. 


Proof. (1) Leto = (a, --- ag) and é = (b; --- by) be any two k-cycles. The 
preceding lemma tells us that in order to show that o and 6 are conjugate, we need 
only find t € S, such that t(a;) = 0; for all i; in this case, tot! = 6. But S, 
contains every possible permutation of {1,...,}. Thus, we can certainly assign 
t(a;) = b;, and for the j ¢ {a,..., ax}, let the t(j) be any distinct values not in 
{b,,..., Dx}. 

(2) As k is odd, the k-cycles are even, and therefore lie in A,. Let o and 6 be any 
k-cycles. Without loss of generality, let us say that 6 = (1 2 --- k). Then just as in 
(1), wecan findt € S, suchthat tor~! = 6. If t € A,, then we are done. Otherwise, 
t is odd, so ((k + 1) (k + 2))t is even. (Note that this is valid, as n > k + 2.) Thus, 
letting 7 = (kK + 1) (K+ 2))t © An, we have 


non”! = ((k+1) (k+2))tor!((k + 1) (k +2) 
= ((k+ 1) k+2)) 2 ++» K+ 1) K+ 2)). 
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But disjoint cycles commute, so this is 
(K+ 1) (K+2))(k + 1) (K+ 2))6 = 54. 


We are done. oO 


Example 6.10. The preceding lemma tells us that (1 2 3) and (1 3 2) are conjugate 
in S3, and the proof suggests how we might demonstrate it. We need to find t such that 
t(1) = 1, t(2) = 3 and r(3) = 2; that is, t = (2 3). Then (1 3 2) = r(1 2 3)t7!. 
However, (1 2 3) and (1 3 2) are not conjugate in A3; this is obvious, as A3 is 
abelian, having order 3, so different elements are not conjugate. It is less obvious 
that they are not conjugate in A, either; however, it is possible to try conjugating 
(1 2 3) by all of the elements of Ay. None of these conjugates will equal (1 3 2). 
However, the preceding lemma tells us that (1 2 3) and (1 3 2) are indeed conjugate 
in As, and the proof tells us that if we take 7 = (4 5)t, then n € As, and we find 
that n(1 2 3)n-! = (1 3 2). 


We can now simplify our task by showing that if we have a 3-cycle in a normal 
subgroup of A,,, then we have all of A,. 


Corollary 6.3. Let n > 3. Then 


I. every element of A,, is a product of 3-cycles; and 
2. ifanormal subgroup N of A, contains any 3-cycle, then N = Ay. 


Proof. (1) We know that an element of A,, is a product of an even number of trans- 
positions. Thus, it is sufficient to show that every product of two transpositions is 
a product of 3-cycles. (As (1) = (1 2 3)(1 3 2), we need not worry about the 
identity.) If the two transpositions are equal, then their product is the identity, with 
which we have just dealt. Suppose they have one number in common. Without loss 
of generality, say (1 2)(1 3). Then note that (1 2)(1 3) = (1 3 2), which is a 3-cycle. 
Finally, suppose they have no numbers in common. Without loss of generality, say 
(1 2)(3 4). Then we observe that (1 2)(3 4) = (1 4 3)(1 2 3), which is a product of 
3-cycles. 

(2) In view of (1), it is sufficient to show that N contains all of the 3-cycles. But 
it contains one 3-cycle, so as N is normal, it contains all of its conjugates. If n > 5, 
then Corollary 6.2 tells us that these conjugates are all of the 3-cycles, and we are 
done. If n = 3, there is little to do, as the only 3-cycles are (1 2 3) and (1 3 2), 
and they are squares of each other; thus, if N contains one, it contains the other. The 
n = 4 case requires a little more work, and we leave it as Exercise 6.24. oO 


And now, our main result for this section. 
Theorem 6.11. [fn > 5, then A, is anonabelian simple group. 


Proof. The fact that (1 2 3)(1 2 4) € (1 2 4)(1 2 3) shows that A, is nonabelian, 
so we can focus on the simplicity. Let N be a nontrivial normal subgroup of A,,. We 
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must prove that N = A,. In view of Corollary 6.3, it is sufficient to show that N 
contains a 3-cycle. 

Take any (1) #4 o € N, and consider the disjoint cycle decomposition of o. 
Suppose, first of all, that there are two or more transpositions in this decomposition. 
Without loss of generality, say o = (1 2)(3 4)6, where 6 is a product of disjoint 
cycles which also fix everything in {1, 2, 3,4} (and 6 = (1) is possible). Let t = 
(1 2 4) € A,. Then as N is normal in A,, we have tot~! € N. That is, 


(12. 4)(1 2)(3 4)5(1 4.2) EN. 


(It is easy to check that (1 2 4)~! = (1 4 2).) As the cycles in 6 are disjoint from all 
the other cycles in the product, we see from Theorem 6.4 that NV contains 


(1 24)1 2)(3 4) 4 2)6 = C1 3) 4)6. 
But N also contains o~!, and therefore 
o'(13)(2 4)6 = 8° '(3 4)(1 2)(1 3)(2 45 EN. 
Again, 6 commutes with these other cycles, so we have 
6 '6(3 4) 2)(1 3)(24) = 1423) EN. 
Let 7 = (1 45) € A, (since n > 5). Then N must contain 
n(1 4)(2 3)n7' = (145)(1 4)(2 3). 5 4) = (2 3)(4 5). 


Thus, N also contains 
(1 4)(2 3)(2 3)(45) = (145). 


But when WN contains a 3-cycle, we know that N = A,. Thus, from this point on, 
we may assume that the disjoint cycle decomposition of o contains at most one 
transposition. 

Now, let us consider the length k of the longest cycle appearing in the disjoint 
cycle decomposition of 0. Ifk = 2, then o is a product of an even number of disjoint 
transpositions, and we have already dealt with this case. 

Suppose that k = 3. Then o is a product of some 3-cycles and, possibly, some 
transpositions. But the product of some 3-cycles and a single transposition is odd, and 
therefore not in A,,. Furthermore, multiple transpositions are not allowed. Therefore, 
we may assume that o is a product of one or more 3-cycles. If it is just one 3-cycle, 
then we are done. So assume that it is a product of two or more disjoint 3-cycles. 
Without loss of generality, say 0 = (1 2 3)(4 5 6)6, where either 5 = (1) or 6 
is a product of disjoint 3-cycles, all of which fix everything in {1, 2, 3, 4, 5, 6}. Let 
t = (345) € A,. Then as N is normal, it contains 
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tat | = (345)(1 2 3)(45 6)5(3 5 4) = (1 2 4)(3 6 5)6, 


1 


since 6 commutes with the other cycles. But N also contains o~", so we have 


o1(124)365)6 = 5 '(465)(1 3 2)(1 2.4)(3 65)5 = (26435) EN, 


again, since 6 is disjoint from the other cycles. Replacing o with (2 6 4 3 5), we can 
move to our final case. 

Let us suppose that k > 4. Then without loss of generality, we may write 0 = 
(123 --- k)d, where k > 4 and 6 is some product of disjoint cycles, all of which fix 
everything in {1,2,...,k}. Let t = (1 2 3) € A,. Then by normality, N contains 


tot! =(123)(123 =». S132) = (145 «+ k23)8. 


But N also contains o~', so noting that (1 23 --- kK)! =(1k (k—1) --- 2), we 
have 


oe 14S a RIBS 5 1b ea Ds DA AS RII SUB HEN, 


again using the fact that 5 commutes with everything else. Thus, NV contains a 3-cycle, 
and the proof is complete. Oo 


We might well ask about A, whenn < 5. Forn = 2, Az is the trivial group; hence, 
by definition, not simple. When n = 3, A3 has order 3 and by Corollary 4.2, it is 
isomorphic to Z3. By Theorem 6.10, it is an abelian simple group. The big exception 
is the n = 4 case, as illustrated in the following example. 


Example 6.11. The alternating group A, is not simple. To see, this let 
N = {(1), 0 2)3 4), 0 3)@2 4), 0 4) 3)}. 


It simply requires some computation to see that N is a nontrivial proper normal 
subgroup of Aq. 


With the exception of Sz, which is abelian of order 2, and hence isomorphic to 
Zy, the symmetric groups are not simple. Indeed, A, is a nontrivial proper normal 
subgroup of S,, whenever n > 3. However, we can state the following result. 


Corollary 6.4. [fn > 5, then the only nontrivial proper normal subgroup of Sy is 
An. 


Proof. Let N be a normal subgroup of S,,. Then NM A, is a normal subgroup of 
A,. As A, is simple, this means that NM A, = A, or {(1)}. If NO A, = Ay, then 
A, < N. But by Lagrange’s theorem, this implies that |A,,| divides |N| and || 
divides |S,|. As |S,| = 2|A,| (because A, is of index 2), this can only mean that 
|N| = |A,| or |S,|. Thus, N = A, or S,, as desired. 
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On the other hand, suppose that NM A, = {(1)}. Then by Theorem 4.4, |NA,,| = 
|N||An|. As |An| = [Sp|/2 and |NA,| < |S,|, we see that |N| = | or 2. If |N| = 1, 
we are done, so suppose that |V| = 2. But by Exercise 4.3, a normal subgroup of 
order 2 in a group is central. However, Exercise 6.8 tells us that the centre of S, is 
trivial. Thus, we have a contradiction, and the proof is complete. oO 


Exercises 
6.21. Show that A; has no subgroup of order 30. 
6.22. In S7, describe the conjugates of (1 2)(3 4 5). 


6.23. Can a nonabelian simple group have a nonabelian simple proper subgroup? 
Either prove that it cannot, or construct an explicit example. 


6.24. Let N be anormal subgroup of A4 containing a 3-cycle. Show that N = Ag. 


6.25. Show that the only nontrivial proper normal subgroup of A, is the one exhib- 
ited in Example 6.11. 


6.26. Let n > 2. Show that every element of S, can be written as a product of 
transpositions of the form (1 7), for various 7. 


6.27. If n > 2, show that every element of S, can be written as a product of the 
transpositions (1 2), (2 3),...,((”— 1) n). 


6.28. Ifn > 2, leto = (1 2) andt = (1 23 --- n). Show that every element of 
S, can be written in the form o"'t/'¢?1?---o!*t/*, where the exponents are any 
integers andk EN. 


Chapter 7 ®) 
The Sylow Theorems rie 


In this chapter, we will prove the Sylow theorems. These are difficult results, but 
fundamental to our understanding of the structure of finite groups. In particular, we 
will show that if p” is the largest power of a prime p dividing the order of a finite 
group G, then G has at least one subgroup of order p”. Furthermore, we will discover 
that any two such subgroups are conjugate to each other, and determine a restriction 
upon the number of such subgroups. We will then explore various applications of 
these theorems, and conclude the chapter by classifying all groups of order smaller 
than 16. 


7.1 Normalizers and Centralizers 


We are very familiar with the centre of a group, which consists of all elements that 
commute with everything. Let us generalize. 


Definition 7.1. Let G be a group, a € G and H a subgroup of G. Then the 
centralizer of a is the set of all elements of G that commute with a. We write 
C(a) = {g € G: ag = ga}. Also, the centralizer of H is C(H) = {g €G: gh= 
hg for allh € H}. 


Example 7.1. Ifa € Z(G), then C(a) = G. If H < Z(G), then C(H) = G. 


In particular, C(e) = G, so we cannot assume that centralizers are necessarily 
abelian. 


Example 7.2. Let G = Ds. Then we find that C(Ro79) = (Roo), C(Rigo) = G and 
C(F\) = {Ro, Riso, Fi, Fo}. 
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Theorem 7.1. Let G be a group, a € G and H a subgroup of G. Then 


C(A) = nen CCA); 

C(a) and C(Af) are both subgroups of G; 

if H is anormal subgroup of G, then so is C(H); 
Z(G) is a subgroup of both C(a) and C(A1); and 
C(a) = C({a)). 


Proof. (1) This follows from the definition. 

(2) Clearly ae = a = ea, soe € C(a). Suppose that b,c € C(a). Then bca = 
bac = abc, so bc € C(a). Also, ba = ab, so b~!(ba)b=! = b~'(ab)b™!. Thus, 
ab~' = b~'a, sob! € C(a). Hence, C(a) < G. Furthermore, combining this fact 
with (1) and Exercise 3.37, we see that C(H) < G. 

(3) See Exercise 4.4. 

(4) Central elements commute with everything hence, in particular, they commute 
with a and elements of H. 

(5) If b € C((a)), then since a € (a), we see that b commutes with a. Thus, 
b € C(a). Conversely, if b € C(a), then ab = ba. Therefore, a € C(b). As 
C(b) < G by (2), we see that a’ € C(b) for all integers i. That is, a'b = ba’, for all 
i € Z. In other words, b € C((a)). 


MRWNS 


Suppose we have a subgroup H of G that is not normal. Of course, H is a normal 
subgroup of H. Furthermore, it is easy to see that H is normal in H Z(G). How 
big a subgroup of G can we find in which H is a normal subgroup? This is where 
normalizers come in. 


Definition 7.2. Let G be a group and H a subgroup. Then the normalizer of H is 
the set N(H) = {a € G: a"'Ha = H}. If K is another subgroup of G, then we 
write Nx (H) = N(A) 1 K, and call it the normalizer of H in K. 


Remember, if a € C(H), then a~'ha = h, for all h € H. But ifa € N(A), 
then a~'!Ha = H. In particular, a~'ha = hy, for some (possibly different) h; € H. 
Thus, the normalizer and centralizer are different concepts. 


Example 7.3. If H is anormal subgroup of G, then N(H) = G. See Theorem 4.3. 


Example 7.4. Let G = S4 and H = ((1 2 3 4)). Then notice that (24) ¢ C(A), as 
(24,1234) = (1 4)? 3), but (1 23 4)(2 4) = (1 2)(3 4). However, 


(24)71(1 23 4)(2 4) = (24) 23 4)(24) = (143 2) = (1234) EH. 
Thus, 
(24)7'(1 23 4)(2 4) = (2.4) '(1 23: 4)(2 4))' = (1 43 2)' EF, 


for alli € Z. Therefore, (2 4)~'H(2 4) < H. By Theorem 4.2, |(2 4)-'H(2 4)| = 
|H|, so we conclude that (2 4)~-!H(2 4) = H. Thus, (2 4) € N(A#). 
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Theorem 7.2. Let H be a subgroup of G. Then N (A) is a subgroup of G containing 
H. Furthermore, if K is a subgroup of G containing H, then H is normal in K if 
and only if K is a subgroup of N(#1). 


Proof. Take any h € H. Then for any c € H, wehaveh-!ch € H,soh"'Hh < H. 
Also, h7!(heh7!)h = c, and hch—! € H. Thus, every element of H is in h~'Hh, so 
H =h~'Hh,and H C N(#).Inparticular, e € N(H). Now, take anya, b € N(#). 
Then (ab)~'!'Hab = b~'(a~'Ha)b = b“'Hb = H; thus, ab € N(A). Also, as 
a~'Ha = H, we have aa~!Haa~! = aHa™!; that is, H = (a~')~'Ha™', hence 
a! € N(H), and N(H) < G. 

Let H < K < G. Then in view of Theorem 4.3, H is a normal subgroup of K 
if and only if kK~' Hk = H for all k € K. By definition of N(#), this occurs if and 
only if K < N(A). 


It is clear that if H is a subgroup of G, then C(H) < N(#). They could, of 
course be equal; indeed, if H < Z(G), then C(H) = N(A) = G. But as we saw 
in Example 7.4, they need not be. An interesting fact about the relationship between 
these two subgroups is given in the following result. 


Theorem 7.3 (NV /C Theorem). Let G be a group and H a subgroup. Then C(H) 
is a normal subgroup of N(H), and N(H)/C(A) is isomorphic to a subgroup of 
Aut(#). 


Proof. We willshow that C (#7) is anormal subgroup of N (7) by illustrating that it is 
the kernel of ahomomorphism from N (#7) to Aut(H). Define a : N(H) > Aut(H) 
via a(a)(h) = aha™!, for alla € N(H), h € H. If a is in the normalizer, then so 
is a~', and therefore aHa~' = H. Thus, we see immediately that a(a) is an onto 
function from H to H. Also, if ahja~! = ahza™!, then h; = hz by cancellation, so 
a(a) is one-to-one as well. Furthermore, for any h;, hz € H, we have 


a(a)(hyhz) = ahyhga~! = aha" ‘ahza~! = (a(a)(h1)) (@e(a) (hz). 


Therefore, a(a) € Aut(H). 
We need to show that a is a homomorphism. But if a,b ¢ N(#), then for any 
h € H, we have 


a(ab)(h) = abh(ab)~! = abhb~!a7! = a(a(b)(h))a~! = a(a)(a(b)(h)). 


Thus, a(ab) = a(a)oa(b), as required. Now, the kernel of a is the set of allc € N(H) 
such that a(c) acts as the identity on H; specifically, we must have chc~! = h, for all 
h € H. But this is precisely the definition of C(H). The First Isomorphism Theorem 
now tells us that N(H)/C(#) is isomorphic to wa(N(#)), which is a subgroup of 
Aut(#), as required. 


The following example illustrates a cute application of the N/C Theorem. 
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Example 7.5. Suppose that G is a nonabelian group of order 39. Let us demonstrate 
that G cannot possibly have a normal subgroup H of order 3. Suppose such a normal 
subgroup exists. Then N(H) = G. Also, what can the centralizer of H be? As H 
has prime order, Corollary 3.6 tells us that it is cyclic, hence abelian. In particular, 
H centralizes itself. Thus, H < C(H) < G. By Lagrange’s theorem, we see that 
3 divides |C(H)|, which in turn divides 39. The only possibilities are |C(H)| = 3 
or 39. Suppose that |C(H)| = 39. Then C(H) = G, so A is central. In particular, 
H < Z(G) < G. But again, looking at the orders, we see that Z(G) = H or G. As 
G is not abelian, Z(G) = H, but that cannot be the case either, as then Z(G) has 
prime index which, by Corollary 4.1, is impossible. 

Therefore, |C(H)| = 3, so C(H) = H. By the preceding theorem, N(H)/C(H) 
is isomorphic to a subgroup of Aut(#). That is, G/H is isomorphic to a subgroup 
of Aut(H). But |G/H| = |G|/|H| = 13. As F is cyclic, the structure of Aut(H) 
is given by Theorem 4.22. But even if we did not have that resource, H is a set of 
3 elements, so there are only 3! = 6 ways to permute them (and not all of those 
are automorphisms). Thus, we are trying to fit a group of order 13 inside one that is 
simply too small. Hence, H cannot exist. 


Exercises 


7.1. Which matrices lie in the centralizer of (( i} in GL2(R)? 


7.2. Which permutations lie in the centralizer of (1 2 3) in $5? 
23 
5 


7.3. In GL2(R), let H be the subgroup generated by ( 6 


N(A). 


7.4. Let H, < G, and Hy < G». Show that in G, x G2, C(M, x Hy) = C(M) x 
C(Ap). 


7.5. If G is a group having a subgroup H of order 2, show that C(H) = N(#Z). 


) Show that C(H) = 


7.6. If G is a nonabelian group, show that G has a subgroup H such that Z(G) © 
H C G. (Yes, this is the same as Exercise 4.20. Solve it using the results in this 
section.) 


7.7. In any group, show that C(a) = C(a7!). 

7.8. Ifa € G anda has odd order, show that C(a) = C(a’*). 

7.9. Let G be a group of order 77 having a normal subgroup H of order 11. 
1. If G is not abelian, show that C(H) = H. 

2. Conclude that G must, in fact, be abelian. 


7.10. Let G be a group of order 77. 


1. Show that G has a subgroup H of order 11. 
2. Show that H is unique, and hence normal. 
3. Conclude that G is isomorphic to Z77. 
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7.2 Conjugacy and the Class Equation 


We are already familiar with the notion of conjugacy in groups. To reiterate, we say 
that a and b in G are conjugate if there exists a g € G such that g-!ag = b. Here is 
a simple fact that we have not mentioned. 


Theorem 7.4. /f G is any group, then conjugacy is an equivalence relation on the 
elements of G. 


Proof. Reflexivity: For any a € G, we have e~!ae = a, so a is conjugate to itself. 


Symmetry: Suppose that a is conjugate to b, say g-!'ag = b. Thena = (g7!)~!bg"!, 
and therefore b is conjugate to a. Transitivity: Suppose that a is conjugate to b, and 
b toc, say g 'ag = b and h-'bh = c. Thence = h-!(g7!ag)h = (gh)~!a(gh). 
Thus, a is conjugate to c. 


We know that equivalence classes partition a set; thus we can break a group down 
into disjoint sets of elements, all elements in each set being conjugate to each other. 


Definition 7.3. Let G be a group and a € G. Then the conjugacy class of a is the 
set C, = {g-'ag: g € G}. 


Conjugacy classes are subsets of G, not subgroups. Indeed, the only one that will 
contain the identity is C.. 


Example 7.6. Note that C, contains only a if and only if a € Z(G). (This happens 
if and only if g-'ag = a for all g € G.) 


Example 7.7. Let k and n be positive integers with n > k. If G = S, ando = 
(123 --- k), then we see from Lemma 6.2 that C, is the set of all k-cycles in G. 


It is important to know the size of a conjugacy class. 


Lemma 7.1. Let G be a finite group anda € G. Then the number of elements in Cg 
is the index of the centralizer, [G : C(a)]. 


Proof. Take g,h € G. Then notice that g~'ag = h7'ah if and only if gh-'a = 
agh—'. That is, g~'ag and h~'ah produce the same conjugate of a if and only if 
gh"! € C(a) or, equivalently, if and only if the right cosets C(a)g and C(a)h are 
equal. In other words, we get a distinct conjugate of a for each right coset of C(a), 
so the number of distinct conjugates is the index, [G : C(a)], as required. 


This allows us to establish an important equation, called the class equation. 


Theorem 7.5 (Class Equation). Let G be a finite group, and let a,,..., ax be 
representatives of the conjugacy classes in G with more than one element. Then 


IG| = |Z(G)| + [G:C] +---+[G: C(a)]. 
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Proof. As G is partitioned into conjugacy classes, we know that |G| is the sum of 
the sizes of these classes. We noted in Example 7.6 that the conjugacy classes of size 
1 are precisely those of the central elements. Collecting them together, we obtain 
|Z(G)| elements. For the remaining classes, we now apply the preceding lemma. 


This has powerful consequences! 


Corollary 7.1. Let G be a group of order p", for some prime p and positive integer 
n. Then the centre of G is not trivial. 


Proof. In the class equation, each [G : C(aq;)] is the size of a conjugacy class with 
more than one element. But it is also |G|/|C(a;)|, and therefore a divisor of p”. Thus, 
each [G : C(a;)] is a multiple of p. But |G| is also a multiple of p, and therefore 
the one remaining term in the equation, |Z(G)|, is a multiple of p. In particular, it is 
not I. 


Corollary 7.2. Let G be a group of order p’, for some prime p. Then G is isomorphic 
to either Z, or Ly X Zy. 


Proof. By Lagrange’s theorem, |Z(G)| € {1, p, p?}. But by the preceding corollary, 
it cannot be 1. Suppose it is p. Then [G : Z(G)] = p*/p = p. By Corollary 4.1, 
this is impossible. Therefore, Z(G) = G, and G is abelian. By Corollary 5.1, we are 
done. 


Theorem 5.6 tells us that Z,2 and Z, x Zp are not isomorphic, so we now have a 
complete picture for groups of order p. 
We also need to know about conjugacy of subgroups. 


Definition 7.4. Let G be a group and H a subgroup. We say that subgroups K and 
L of G are H-conjugate if there exists anh € H such that h>'Kh = L. When 
H =G, we simply say that K and L are conjugate. 


Example 7.8. Let G be Ss; and H = ((1 3)(2 5 4)). Takeo = (1 3)Q 45) = 
((1 3)(2 5 4))~! € H. Then we notice that 


o'(123 4)o = (1 3)(25 4)(1 23. 4) 3)(245) = (1235). 


Therefore, for any integer i, 0~'(1 23 4)'o = (0 '(1 23 4)o)! = (1 23 5)’. This 
means that o~!((1 2 3 4))o = ((1 2 3 5)). Thus, ((1 2 3 4)) and ((1 2 3 5)) are 
H-conjugate (and, therefore, conjugate). 


Theorem 7.6. Let G be a group and H a subgroup of G. Then H-conjugacy is an 
equivalence relation on the set of all subgroups of G. 


Proof. Reflexivity: Let K < G. Then e € H ande!Ke = K. Therefore, K is 
H-conjugate to itself. Symmetry: Suppose that h~'Kh = L, with h € H. Then 


K =hh'Khh"! =hLh"! = (h!)'La. 
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Since h~! € H, we see that H-conjugacy is symmetric. Transitivity: Suppose that 
hy'Kh, = Land hy'Lhy = M, where hy, hy € H. Then 


M = hy! (hy Khy)ho = (hyhz) | K (hyhz). 


Since hh» € H, we are done. 


Thus, the subgroups of G are partitioned into equivalence classes, in which those 
in each class are H-conjugate to each other. In a similar fashion to Lemma 7.1, we 
have a formula for the number of H-conjugates of a subgroup. 


Theorem 7.7. Let G be a finite group and H a subgroup. Then for any subgroup K 
of G, the number of H-conjugates of K in G is[H : Ny(K)]. 


Proof. Take hy,h2 € H. Then hy! Kh; = hy'Khy if and only if hah; | Khyhy! = 
K;; that is, if and only if (hyhy')~'K (hi hz') = K. But this means precisely that 
hyhy' € Ny(K) or, in other words, that Ny(K)h, = Ny(K)h2. Thus, we get 
one distinct conjugate for each right coset of N;,(K) in H, so the number of such 
conjugates is the index, [H : Ny(K)]. 


Exercises 


7.11. What are the conjugacy classes of Dg? 
7.12. What are the conjugacy classes of S4? 
7.13. Let G be a group having subgroups H and K. Suppose that two subgroups 


of G are both H-conjugate and K-conjugate. Does it follow that they are (HM K)- 
conjugate? Either prove that it does or construct a counterexample. 


7.14. Let G be a finite group with normal subgroup N. Show that there are at least 
as many conjugacy classes in G asin G/N. 


7.15. Let G be a group of order p”, where p is a prime and n > 2. Suppose that 
|Z(G)| = p. Show that there exists an a € G such that |C(a)| = p""!. 


7.16. If G is a group of order p”, for some prime p and positive integer n, show 
that G has a subgroup of order p” for each positive integer m <n. 


7.17. Foreach of the following lists, determine ifit is the list of sizes of the conjugacy 
classes of some finite group. If it is, provide such a group. If not, explain why not. 


1. 1,1,1,1,1,5,5,5,5 

2. 1,2,3 

3. 2,4,6 

7.18. Let G be a group and H the set of elements of G having only finitely many 
conjugates. Show that H is a subgroup of G. 


7.19. Suppose that G isa finite group and there existse 4 a € Gsuchthata™! € C,. 
Show that G has even order. 


7.20. Let G be a group of ordern > 1. Show that no conjugacy class can have order 
greater than n/2. 
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7.3 The Three Sylow Theorems 


We can now present the three major theorems due to P. Ludwig Sylow concerning 
subgroups of prime power order in a finite group. We will give the statements and 
proofs in this section, and some applications in the following sections. 


Definition 7.5. Let G be a finite group, and suppose that |G| = p”r, where p is 
a prime, n > 0 and r is a positive integer such that (p,r) = 1. Then a Sylow 
p-subgroup of G is any subgroup of order p”. 


By Lagrange’s theorem, if H is a subgroup of G of order p*, for some k, then the 
order of H cannot possibly be any larger than that of a Sylow p-subgroup. 


Example 7.9. If the p-elements of G form a subgroup H, then that is the unique 
Sylow p-subgroup. By Lemma 5.3, this happens whenever G is a finite abelian group. 
But it can also occur for certain nonabelian groups. As an obvious example, consider 
Dg. The entire group is the Sylow 2-subgroup. 


Example 7.10. As |A4| = 12, a Sylow 2-subgroup has to have order 4 and a Sylow 
3-subgroup has to have order 3. In fact, there is just one Sylow 2-subgroup, namely 
{1),d 2)3 4,d 3)@2 4), 4)(@ 3)} (discussed in Example 6.11). However, 
there are four different Sylow 3-subgroups, namely ((1 2 3)), ((1 2 4)), (1 3 4)) 
and ((2 3 4)). 


The First Sylow Theorem says that we can always find a Sylow p-subgroup. 


Theorem 7.8 (First Sylow Theorem). Let G be a finite group. Then for every prime 
D, G has at least one Sylow p-subgroup. 


Proof. We will proceed by strong induction on |G]. If |G| = 1, then {e} is the Sylow 
p-subgroup for any prime p. Therefore, let |G| > 1 and suppose that the theorem 
holds for smaller groups. Fix a prime p, and let |G| = p"r, where n > 0 and 
(p,r) = 1. Ifn = 0, then again, the Sylow p-subgroup is {e}, so assume that n > 1. 

Suppose there exists a noncentral element a € G such that p does not divide 
[G : C(a)]. Then as |G| = |C(a)|[G : C(a)], we see that p” divides |C(a)| (and 
certainly no higher power of p can do so, as C(a) < G). But a is not central, so 
C(a) # G. Therefore, by our inductive hypothesis, C(a) has a subgroup of order 
p”. But this is also a subgroup of G, completing this case. 

Therefore, assume that for every noncentral a € G, we have p|[G : C(a)]. Also, 
p||G|. Therefore, p divides every term in the class equation except for |Z(G)|, which 
means that p must divide |Z(G)| as well. By Cauchy’s theorem for abelian groups, 
Z(G) has an element z of order p. Then (z) is a central, hence normal, subgroup 
of order p in G. Furthermore, |G/(z)| = |G|/|(z)| = p"r/p = p"~'r. By our 
inductive hypothesis, G/(z) has a subgroup of order p”~!. But Theorem 4.8 tells 
us that the subgroups of G/(z) are of the form H/(z), where H is a subgroup of 
G containing (z). However, |H| = |H/(z)||(z)| = p”-'p = p”. Therefore, H is a 
Sylow p-subgroup. 
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The Second Sylow Theorem says that Sylow p-subgroups are always conjugate 
to each other. 


Theorem 7.9 (Second Sylow Theorem). Fix a prime p. Let G be a finite group and 
P a Sylow p-subgroup of G. If H is a subgroup of G of order p*, for some k > 0, 
then H is conjugate to a subgroup of P. In particular, all Sylow p-subgroups of G 
are conjugate. 


Proof. By Theorem 7.7, there are [G : N(P)] different conjugates of P in G. Also, 
by Theorem 7.2, P < N(P). Therefore, 


IG\/|P| = (GI/N(PIDIN(PII/|PD = [G : N(P)IIN(P) : PI]. 


Hence, [G : N(P)] divides |G|/|P|. But by definition of a Sylow p-subgroup, 
|G|/|P| is relatively prime to p; thus, the number of conjugates of P is relatively 
prime to p. 

Among all of these subgroups conjugate to P (and hence to each other), let us 
consider those that are H-conjugate. We know that H-conjugacy is an equivalence 
relation, and the H-conjugacy classes partition the set of all conjugates of P. If all 
of these H-conjugacy classes contained numbers of elements that are divisible by p, 
then the total number of conjugates would be divisible by p, which is impossible. 
Therefore, there is a subgroup K of G, conjugate to P, such that the number of 
H-conjugates of K is not divisible by p. Now conjugate subgroups have the same 
order, so K is also a Sylow p-subgroup of G. 

By Theorem 7.7, the number of H-conjugates of K is [H : Ny(K)]. But |H| = 
p*, and [H : Ny(K)] = |H|/|Nx(K)| is a divisor of |H|. The only way we can 
avoid having [H : Ny(K)] be a multiple of p is if itis 1. Thus, H = Ny(K). That 
is, H = HM N(K), which means that H < N(K). 

Theorem 7.2 tells us that N(K) contains K as a normal subgroup. Thus, by 
Theorem 4.5, HK is also a subgroup of N(K). But by Theorem 4.4, |HK| = 
|H||K|/|H  K|. However, |K| is the largest power of p dividing |G|, and since 
|H K| must divide |G|, we conclude that p does not divide |H|/|H M K|. As H has 
order p. this means that HM K = H; thus, H < K. But K is aconjugate of P! That 
is, H < g-'Pg, for some g € G. Equivalently, (g~!)~'Hg~! < P, as required. 

The fact that Sylow subgroups are conjugate now follows immediately from the 
fact that conjugate subgroups have the same order. 


The Third Sylow Theorem imposes restrictions upon the possible numbers of 
Sylow p-subgroups in a group. 


Theorem 7.10 (Third Sylow Theorem). Let p be a prime and G a group of order 
p"r, where n > 0 and (p,r) = 1. Then the number of Sylow p-subgroups of G is 
congruent to 1 modulo p and divides r. 


Proof. Fix a Sylow p-subgroup P. By the Second Sylow Theorem, every Sylow 
p-subgroup of G is conjugate to P. Also, as conjugate subgroups have the same 
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order, only Sylow p-subgroups can be conjugate to P. Therefore, the set of Sylow 
p-subgroups of G is precisely the set of conjugates of P. By Theorem 7.7, there are 
[G : N(P)] such conjugates. But P < N(P), which means that 


[G: P]=|G|/|P| = (IGI/IN(P))UN(P)I/| PI). 


By definition of the Sylow p-subgroup, [G : P] = r, which means that [G : N(P)] 
divides r, giving us the last part of the theorem. 

Now, we know that P-conjugacy is an equivalence relation on the set of all Sylow 
p-subgroups. Thus, the number of Sylow p-subgroups is the sum of the sizes of 
the P-conjugacy classes. But if H is a Sylow p-subgroup, then by Theorem 7.7, 
it has precisely [P : Np(H)] P-conjugates. As P has order p*, we see that [P : 
Np(A)] = |P|/|Np(A)| is also a power of p. Thus, it is in particular a multiple 
of p, unless it is 1. So, to determine the number of Sylow p-subgroups modulo p, 
we have only to consider those H such that P = Np(/Z). But this happens if and 
only if P = N(H)/ P; that is, if and only if P < N(#). Now proceed as in the 
proof of the Second Sylow Theorem; we see that this happens if and only if P < H. 
However, P and H have the same order, so this means that P = H. 

That is, modulo p, the number of Sylow p-subgroups is [P : Np(P)]. But P is 
normal in itself, so this is [P : P] = 1. The proof is complete. 


Exercises 


7.21. Find all Sylow 2-, 5- and 7-subgroups of Zj099 x Zy4. 
7.22. Find all Sylow 2- and 3-subgroups of A4. 
7.23. Let G bea group of order 294. Show that G has exactly one Sylow 7-subgroup. 


7.24. Let G bea finite group. Explain why it is impossible for G to have one Sylow 
2-subgroup isomorphic to Z,4 and another Sylow 2-subgroup isomorphic to Zz x Zo. 


7.25. Suppose that p is a prime and p” divides |G|, for some n € N. Show that G 
has a subgroup of order p”. 


7.26. Find a Sylow 2-subgroup of Sy. To what familiar group is it isomorphic? 


7.27. Let G be a finite group having a normal subgroup N. If H is a Sylow 
p-subgroup of G, show that HN/N is a Sylow p-subgroup of G/N. 


7.28. If G is a finite group with normal subgroup N, and H is a Sylow p-subgroup 
of G, show that H M N is a Sylow p-subgroup of NV. 
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7.4 Applying the Sylow Theorems 


Let us discuss some interesting consequences of the Sylow theorems. For one thing, 
we can now complete Cauchy’s theorem, which we previously discussed for abelian 
groups. 


Theorem 7.11 (Cauchy’s Theorem). Let G be a finite group, and suppose that a 
prime p divides its order. Then G has an element of order p. 


Proof. By the First Sylow Theorem, G has a Sylow p-subgroup P, the order of which 
is p", for some positive integer n. Take any e £ a € P. By Lagrange’s theorem, 
|a| = p*, 1 < k <n. Then by Corollary 3.2, ja” '| = p. 


AS a consequence, we can now extend Corollary 5.2 to nonabelian groups. 


Corollary 7.3. Let p be a prime. Then a finite group G is a p-group if and only if 
|G| = p”, for some n => 0. 


Proof. If |G| = p”, then by Lagrange’s theorem, every element has order dividing 
p", So G is a p-group. On the other hand, if some prime q different from p divides 
|G|, then by Cauchy’s theorem, G has an element of order q, so itis not a p-group. 0 


The Third Sylow Theorem tells us about the possible numbers of Sylow 
p-subgroups. But from the Second Sylow Theorem, we can deduce when there 
is just one such subgroup. 


Corollary 7.4. Let p be a prime and G a finite group. Then G has just one Sylow 
p-subgroup if and only if the Sylow p-subgroup is normal in G. 


Proof. Let P be a Sylow p-subgroup of G. Then P is normal if and only if 
a~'Pa = P for alla € G; in other words, if and only if P has only itself as a 
conjugate. But Theorem 7.9 tells us that every Sylow p-subgroup is conjugate to P, 
and since conjugates have the same order, this means that nothing that is not a Sylow 
p-subgroup can be conjugate to P. That is, P is normal if and only if it is conjugate 
only to itself, if and only if there is only one Sylow p-subgroup. 


This corollary is highly useful in finding normal subgroups of groups of a partic- 
ular order. In particular, if we are asked to show that groups of some particular order 
cannot be simple, then our first step is often to see if some Sylow p-subgroup must 
be normal. For instance: 


Theorem 7.12. Let G be a group of order pq, where p and q are primes with p < q. 
Then the Sylow q-subgroup of G is normal. In particular, G is not simple. 


Proof. By the Third Sylow Theorem, the number of Sylow g-subgroups is of the 
form 1+ kq, with k € Z, and divides p. As q > p, the only possibility is k = 0 and 
1+kq = 1. Now apply the preceding corollary. 
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Let us try something slightly more complicated. 


Example 7.11. Let us show that there are no simple groups of order 351. As 351 = 
33 - 13, we note that the number of Sylow 3-subgroups is 1 + 3k and divides 13 and 
the number of Sylow 13-subgroups is 1 + 13/ and divides 27, with k,/ € Z. The 
only solutions for / are 0 and 2; that is, the number of Sylow 13-subgroups is either 
1 or 27. If it is 1, then we know that the Sylow 13-subgroup is normal, and we are 
done. So let us assume that it is 27. Now, each Sylow 13-subgroup is of order 13. 
In a group of prime order, everything but the identity has order equal to that of the 
group; thus, each Sylow 13-subgroup has 12 elements of order 13. Furthermore, if P 
and Q are different Sylow 13-subgroups, then since |P M Q| must divide | P| = 13, 
either P = Q (which is impossible) or PM Q = {e}. Thus, each of the 27 Sylow 
13-subgroups contributes 12 elements of order 13, and there is no overlap. We have 
now used up 12 - 27 = 324 elements of the group. This leaves only 351 — 324 = 27 
elements. But that is the size of one Sylow 3-subgroup! Thus, there is only room 
for one such subgroup. In order words, either the Sylow 13-subgroup or the Sylow 
3-subgroup must be normal. 


We do have to be a bit careful in solving problems like the one in the preceding 
example. It would not have worked well if we had considered the Sylow 3-subgroups 
first. To be sure, we would have found that the number of such subgroups is | or 13, 
and if it is 1, we are done. But if it is 13, we would have a problem counting the 
3-elements we have used, because the Sylow 3-subgroups do not have prime order 
and, therefore, do not necessarily intersect trivially. 

Let us consider groups with order the product of three primes. 


Theorem 7.13. Let G be a group of order pqr, where p, q and r are distinct primes. 
Then G is not simple. 


Proof. Without loss of generality, let us say that p < gq < r. Then the number of 
Sylow r-subgroups is of the form 1 + kr, with k € Z, and divides pg. Now, the 
only positive divisors of pg are 1, p, g and pq. Since r > q > p, we cannot have 
1+kr = porq. If there is only one Sylow r-subgroup, then by Corollary 7.4, we are 
done. Thus, let us assume that there are pq of them. Now, these Sylow r-subgroups 
have prime order, so just as in Example 7.11, they intersect trivially, and provide us 
with pq(r — 1) elements of order r. 

Similarly, the number of Sylow q-subgroups is 1 + /g, with / € Z, and divides 
pr. AS q > p, it cannot be p. If it is 1, then once again, we are done. So it is either 
r or pr. In any case, it is at least r. Therefore, by the same argument, we obtain at 
least r(g — 1) elements of order q. 

Finally, the number of Sylow p-subgroups is | + mp, with m e€ Z, and divides 
qr. If itis 1, then we are done, so we may assume that it is at least g. Thus, we obtain 
at least g(p — 1) elements of order p. 

Adding in the identity, we now have at least 


par —l+rq-)+q~-)+1= par+qr—q-rt+1> part+qr—2r+1 
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elements (since g < r). Butas g > p, and p is a prime, we have g > 3, so gr > 2r, 
and we have accounted for more than pgr group elements, which is impossible. 


In the special case where all of the Sylow p-subgroups are normal, we are in an 
even better position. 


Theorem 7.14. Let G be a group of order p{' +++ py‘, where the p; are distinct 
primes and the n; are positive integers. If, for each i, G has a unique Sylow 
pi-subgroup P,, then G = P, x -+- X Px. 


Proof. Let a; bea p;-element of G. Then by the Second Sylow Theorem, there exists 
agi EG such that g7' (a;) 9; < Pi; say 8) |Gi8; =h; € P;. Thena; = gihig, | Ee P; 
since, by Corollary 7.4, P; is normal. In particular, each P, is the set of all p;-elements 
of G. 

By Lemma 5.4, every element of G can be written as a product of p;-elements, 
1 <i <k.Thus,G = P,P, --- Py. By Exercise 5.10, G is the internal direct product 


of the P;. 


Example 7.12. Suppose we wish to classify the groups of order 45. The number of 
Sylow 3-subgroups is 1 + 3k and divides 5, for some k € Z. Thus, it can only be 1. 
The number of Sylow 5-subgroups is 1+ 5/ and divides 9, for some/ € Z. Therefore, 
the Sylow 5-subgroup is unique as well. According to the preceding theorem, a group 
of order 45 must be the direct product of its Sylow subgroups. By Corollary 7.2, a 
group of order 9 is isomorphic to either Z3 x Z3 or Zo, and Corollary 4.2 tells us that 
a group of order 5 is isomorphic to Zs. Hence, every group of order 45 is isomorphic 
to either Z3 x Zs x Zs or Zo x Zs (and these are not isomorphic to each other, by 
Theorem 5.6). 


Exercises 


7.29. Show that there are no simple groups of order 84. 
7.30. Show that there are no simple groups of order 56. 


7.31. Let G bea group of order 4352 = 28-17. Show that either a Sylow 2-subgroup 
or a Sylow 17-subgroup of G must be normal. 


7.32. Let G be a group of order 870 = 2-3-5 .- 29. Show that at least one of the 
Sylow p-subgroups of G must be normal, for some prime p dividing |G|. 


7.33. Let G bea group of order p?q, for some distinct primes p and q. If q { (p?—1) 
and p { (q — 1), show that G is abelian. 


7.34. Show that Theorem 7.13 is still true even if the primes p, qg and r are not 
assumed to be distinct. 


7.35. Let G bea group of order 57. There are only two possible numbers of elements 
of order 3 in G. What are they? 
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7.36. Let G be a group of order 935 = 5-11-17. Show that the Sylow 17-subgroup 
of G is central. 


7.37. Let G be a group of order 595 = 5 -7- 17. Show that G has a subgroup of 
order 119. 


7.38. Let G be a nontrivial finite p-group. If H is a proper subgroup of G, show 
that H is a proper subgroup of N(/#7). 


7.5 Classification of the Groups of Small Order 


We conclude our discussion of groups by classifying the groups of order up to 15. 
Why 15 in particular? Because the classification of the groups of order 16 is a 
confounded nuisance! There are, in fact, 14 different nonisomorphic groups of that 
order. We are aware of the five abelian groups (see Example 5.7), the dihedral group 
Dj 6, and Dg x Zp, but constructing the other seven would be a lot of work. 

Let G be a group of order n. We already know all of the possibilities for most 
of the values n < 16. If n = 1, there is only the trivial group, {e}. When n € 
{2,3,5, 7, 11, 13}, Corollary 4.2 tells us that G is isomorphic to Z,. If n € {4, 9}, 
we rely upon Corollary 7.2, which says that if p is a prime and n = p’, then G is 
isomorphic to Z,2 or Z, x Zp. Also, ifn € {6, 10, 14}, then we use Theorem 4.15; 
when n = 2p:, for some odd prime p, we find that G is isomorphic to Zp, or Dp. 

We are left with groups of order 8, 12 and 15. With the aid of the Sylow theorems, 
the n = 15 case is a piece of cake. 


Theorem 7.15. Every group of order 15 is isomorphic to Zs. 


Proof. By the Third Sylow Theorem, the number of Sylow 3-subgroups is 1 + 3k, 
for some k € Z, and divides 5. Thus, there is only one Sylow 3-subgroup. By The- 
orem 7.12, the same is true for the Sylow 5-subgroup. Therefore, by Theorem 7.14, 
our group is the direct product of these Sylow subgroups. But the Sylow subgroups 
have prime order and, therefore, are cyclic. By Theorem 5.4, the direct product of 
cyclic groups of relatively prime order is also cyclic. Thus, by Theorem 4.14, our 
group is isomorphic to Z)5. 


Unfortunately, when it comes to groups of order 8, the Sylow theorems cannot 
help us. Indeed, for any finite p-group, the unique Sylow p-subgroup is the whole 
group. We can, nevertheless, classify the groups of order 8 up to isomorphism. In 
view of Corollary 5.1, we know that the abelian groups of order 8 are all isomorphic 
to one of {Zg, Z4 x Z2, Zy x Zo x Zo} (and, by Theorem 5.6, these groups are not 
isomorphic to each other). 

Let G be anonabelian group of order 8. By Lagrange’s theorem, every nonidentity 
element of G has order 2, 4 or 8. If there is an element of order 8 then G is cyclic, 
hence abelian, which is not the case. Also, if every nonidentity element has order 2, 
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then G is abelian, by Exercise 3.32. Therefore, we may assume the existence of an 
element a of order 4. Then (a) has index 2 and, by Theorem 4.1, is normal in G. 

Take any element b in G that is not in (a). Then we observe that the elements 
of G are precisely a’ and ba‘, 0 < i < 3. Also, by the normality of (a), we have 
b-'ab = a/, for some j. Now, conjugate elements have the same order, so j = 1 or 
3. If j = 1, then a and b commute. But this means that all elements of the form a! 
and ba' commute as well, so G is abelian, which is not the case. Thus, b-!ab = a’. 

What is the order of b? We know it is 2 or 4. Suppose that it is 2. We can now 
follow the final part of the proof of Theorem 4.15, and we see that G is isomorphic 
to Dg. 

Therefore, let |b] = 4. Now, G/(a) has order 2. Thus, (b(a))” = e(a), sob? € (a). 
Furthermore, |b?| = 2 (by Corollary 3.2), so b? = a*. But now we know everything 


about the group. We know what the elements are. Furthermore, a‘a! = a**+', baka! = 


bakt!, 
a‘ba! = b(b~a‘b)a' = b(b~!ab)ka! = b(a*)ka! = ba***! 


and 
(ba*)(ba') = b(ba***!) = azakt! = ann. 


for any k, 1 € Z. 

What this means is, we can completely fill in the group table so, up to isomorphism, 
there can be at most one group meeting this description. This does not, however, mean 
that such a group necessarily exists. As it happens, it does! 


eles 7.13. The quaternion group is the group Qg = {+1, +i, +j +k}, where 

2— P= =-1,ij =k =—ji, jk =i = —kj andki = j = —ik. The element 
Li is the identity, and it is easy to see that the group is closed and every element has 
an inverse (for instance, i! = —i). Checking associativity involves verifying a lot 
of cases, but it does work. Furthermore, ij # ji, so Qg is not abelian. Also, we 
note that the only element of order 2 is —1, whereas Dg has many elements of order 
2. Thus, we have a new group, and it must be the one we described above. (In the 
notation we used, let a = i and b = /.) 


We now record our classification of the groups of order 8. 


Theorem 7.16. Every group of order 8 is isomorphic to one of the following, namely 
Zs, Z4 x Zo, Zo x Zo x Zo, Dg or Qs. 


Finally, suppose that G has order 12. The number of Sylow 3-subgroups is 1+3m, 
m € Z, and divides 4, so it is 1 or 4. As a group of order 3 is cyclic, let H = (d) bea 
Sylow 3-subgroup. The number of Sylow 2-subgroups is | + 2/,/ € Z, and divides 
3, so itis 1 or 3. Let K be a Sylow 2-subgroup, which we know is isomorphic either 
to Z4 or Z x Z>. Let us break our discussion down into cases. 

CASE I: The Sylow 2- and 3-subgroups are both unique. (Note that this must 
be the case if G is abelian, as every subgroup is normal.) By Theorem 7.14, G is 


130 7 The Sylow Theorems 


the direct product of its Sylow subgroups. Thus, G is isomorphic to Z4 x Z3 or 
Zo x Zo x Z3. By Theorem 5.6, these groups are not isomorphic. 

CASE II: There are three Sylow 2-subgroups and four Sylow 3-subgroups. Now, 
we proceed as in Example 7.11. As the Sylow 3-subgroups have prime order, they 
intersect trivially, and every element other than the identity has order 3. Thus, we 
have 4 - 2 = 8 elements of order 3. But this leaves only four elements unaccounted 
for. Thus, there is only room for one Sylow 2-subgroup. This case cannot occur. 

CASE II: The Sylow 2-subgroup is unique, but there are four Sylow 3-subgroups. 
Let us further break this case down. CASE IIa: K is isomorphic to Z2 x Z2. Notice 
that |HK| = |A||K|/|JH 1 K| = 4-3/1 = 12, using Theorem 4.4 and the fact 
that HM K must have order dividing both |H| and |K|. Thus, HK = G. Also, 
Hand K are abelian, and if d € C(K), then H < C(K), and we see that G 
is abelian which, as we noted above, means we must be in Case I. Thus, there 
exists e # a € K such that d"'ad = b # a. As K is normal, b € K. Now, 
if d~'ad = d7'bd, then a = b, which is impossible, so d—'bd # b. Also, if 
d~'bd = a, then d~*ad* = d7'(d~'ad)d = d~'bd = a, so d” € C(a), and 
therefore d = (d*)* € C(a), which is not the case. As K = {e, a, b, ab}, we must 
have d~'bd = ab. This means that d~!abd = d~'add~'bd = bab = a, as the 
Sylow 2-subgroup is isomorphic to Z, x Zz. Now, we know that the elements of G 
are precisely hk, with h € H andk € K. We also know how products work and can 
construct a group table. For example, 


(d?ab)(d*b) = d(d~*abd’)b = d(d~'(d~'abd)d)b = d(d~'ad)b = db” = d. 


Thus, there is at most one group in this case, up to isomorphism. 

The question that remains is, can such a group be constructed? In fact it can, and 
we have already seen it. If we let G = Ay, a = (1 2)(3 4), b = C1 3)Q@ 4) and 
d = (1 2 3), we find that all of our conditions are met. 

CASE IIIb: K = (c) is cyclic of order 4. As K is normal, we have d~'cd € (c), 
say d~'cd = c'. As conjugates have the same order, i = 1 or 3. If i = 1, then 
d € C(c), so K < C(c). As in CASE Ia, G = HK, and we see that G is abelian, 
which is not permitted. Therefore, d~'cd = c?. But then 


d~*cd* =d™'\(d"'cd)d = d“'c3d = (d“'cd)’ = (c*)’ =e. 


Thus, d~3cd? = d7!cd = c}. But d? = e, so we have a contradiction. Therefore, 
this case cannot occur. 

CASE IV: The Sylow 3-subgroup is unique, but G has three Sylow 2-subgroups. 
Again, let us break this down further. CASE IVa: G has a Sylow 2-subgroup isomor- 
phic to Z, x Zy. Now, if K < C(d), then we see that elements of K commute with 
elements of H and once again, G is abelian, which is not permitted. Therefore, take 
a € K such thata~'da ¢ d. Now, (d) is normal, and given that the only nonidentity 
elements are d and d”, we have a~!da = d?. If b is another nonidentity element of 
K, we must also have b~!db = d or d?. In the latter case, 
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(ba)~'d(ba) = a~!(b"'db)a = a~'d?a = (a'da)? = (d’)* = d. 


Thus, one of the nonidentity elements of K centralizes d. Without loss of generality, 
say bd = db. 

What is the order of bd? If (bd)/ = e, thenas b andd commute, we have b/d/ = e, 
so bi = d~i ¢ HK = {e}, since H and K have relatively prime orders. Thus, j 
must be divisible by 2 and 3, and hence 6. On the other hand, (bd)® = b°d® = e, so 
|bd| = 6. Also, a has order 2 and 


a~'bda = (a~'ba)(a~'da) = bd? = (bd), 


since (bd?)(bd) = b?d? = e. Since a does not commute with bd, a ¢ (bd). 
Hopefully this situation rings a bell! Refer to the proof of Theorem 4.15. It is at this 
point that we can conclude that we have constructed Dj. Note that Dj and A, are 
not isomorphic, since Dj has an element of order 6 and A, does not. 

Finally, we have CASE IVb: K = (c) is cyclic of order 4. As H is normal, we 
have c-'dc = d/, for some j. Since the identity is only conjugate to itself, this 
means that c~'dce = d or d’. If c~'dc = d, then we see immediately that all powers 
of c and d commute. But once again, G = K H, so G is abelian, which is not the 
case. Thus, c~!dc = d*. But since G = K H, the elements of G are precisely c"d’, 
0 <i <3and0 <s < 2. And we now know how to take a product of any two 
group elements. For instance, 


(cd)(cd?) = c?(c7!de)d* = Cd’ d* = cd. 


In particular, we can fill in the group table. This means that there is at most one more 
group of order 12 that is not isomorphic to any of the ones we have constructed so 
far. In fact, such a group exists. 


Example 7.14. Let G = S3 x Z4. This is a group of order 24. Let H be the set of 
all elements (o, t) € G such that the permutation o and the number ¢ are either both 
even or both odd. In Exercise 7.40, we are asked to show that H is a subgroup of G 
of order 12 and that it is not isomorphic to any of the other groups of order 12 that we 
have found. Thus, it must be the group from CASE IVb. In fact, using c = ((1 2), 1) 
and d = ((1 2 3), 0), we find that it has the desired properties. 


We have now completed the classification of groups of order 12. 


Theorem 7.17. Let G be a group of order 12. Then G is isomorphic either to Z4 x Zz, 
Za X Ly X Zs, Ag, D2, or the group H from Example 7.14. 


Exercises 


7.39. To which of the groups listed in Theorem 7.17 is De x Zz isomorphic? 
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7.40. Let H be the subset of S3 x Z4 described in Example 7.14. Show that H 
is a subgroup of order 12 in $3 x Z4, and that H is not isomorphic to Z4 x Zs, 
Zo x Zo x Zs, Ag or Dy. 


7.41. Show that every subgroup of Qs is normal. 


7.42. Let H be a finite abelian group. Show that every subgroup of Qg x H is 
normal if and only if H has no elements of order 4. 


7.43. Let p be a prime. If a,b € Z, anda # 0, define ag», : Zp > Zp via 
Ag p(x) =ax +b. 


1. Show that these a, form a group G under composition and, if p > 2, that this 
group is not abelian. 
2. If p =7, find a nonabelian subgroup H of G such that |H| = 21. 


7.44. Show that every group of order 21 is isomorphic either to Z; or to the group 
H from the second part of the preceding exercise. 


7.45. Generalize Theorem 7.15 as follows. If p and q are primes, with p > q and 
q { (p — 1), show that every group of order pq is isomorphic to Zpg. 


7.46. We know from Theorem 6.11 that As is a nonabelian simple group of order 
60. Show that there are no nonabelian simple groups with order smaller than 60. 
(The methods we have discussed up to this point are sufficient to deal with every 
order except for 24, 36 and 48. Here is a hint if |G| = 36: Suppose that G has 
distinct Sylow 3-subgroups H and K. What is |H M K|? What can you say about 
|N(H 1 K)|? Find a nontrivial proper normal subgroup of G.) 


Part III 
Rings 


Chapter 8 M®) 
Introduction to Rings ra 


We now move on to the second major type of algebraic object that we are considering: 
the ring. At first blush, rings look a bit more complicated than groups. Indeed, a ring 
is an abelian group written additively, and we must still impose a multiplication 
operation along with several new rules. But in another sense, rings are easier to deal 
with, because they are more familiar. Indeed, when we think of a ring, we tend to 
think of the integers (although, as we shall see, the integers are actually a special sort 
of ring). 

In this chapter, we will define a ring and prove some properties of rings and 
subrings. We shall also discuss two well-behaved types of rings; namely, integral 
domains and fields. 


8.1 Rings 


Let us now define a ring. 


Definition 8.1. A ring is a set R together with two binary operations, written as 
addition and multiplication, such that 


R is an abelian group under addition; 

if a,b € R, then ab € R (closure under multiplication); 

if a,b,c € R, then (ab)c = a(bc) (associativity of multiplication); 
ifa,b,c € R, thena(b+c) = ab + ac (distributive law); and 
ifa,b,c € R, then (a+ b)c = ac + bc (distributive law). 


pele ae ook to a 


As usual when we have an additive group, we will use additive notation. In par- 
ticular, we write 0 for the additive identity of a ring, and —a for the additive inverse 
of a. Notice that we do not insist that the multiplication operation be commutative. 
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Definition 8.2. A ring R is said to be a commutative ring if ab = ba for all 
a,beR. 


Also, while there is an identity for the addition operation, there does not have to 
be one for the multiplication operation. 


Definition 8.3. A ring R is said to be a ring with identity if R has an element, 
denoted 1, such that la = al = a for alla € R. In this case, we call | the identity 
of R. 


Note that if we refer to the identity in a ring, we mean the multiplicative identity 
1 (if it exists), not the additive identity 0. 


Example 8.1. As we observed in Section 2.4, the sets Z, Q, R and C are all com- 
mutative rings with identity, under the usual addition and multiplication operations. 
Also, we saw in Section 2.5 that the same can be said for Z,,, for any positive integer 
n> 2. 


Example 8.2. The set of even integers, 2Z, can easily be seen to be a commutative 
ring without an identity. There is no even integer that can be multiplied by 2 to get 2. 


Example 8.3. The set of all polynomials with real coefficients is a commutative ring 
with identity, using the usual polynomial addition and multiplication operations. We 
denote it by R[x]. The same can be said for the polynomials with integer coefficients, 
Z{x]. In each case, the identity is the constant polynomial, 1. 


How about an example of a noncommutative ring? 


Example 8.4. Let n be a positive integer. Then the n x n matrices with real entries 
form a ring under matrix addition and multiplication. The identity matrix is the 
identity of the ring. However, if n > 1, then it is not a commutative ring as, for 


. 11\/10 10\ /11 a 
instance, (; i} (; 4 # (; 4 (( | We denote this ring by M,,(R). In fact, as 


we observe in Appendix B, we can substitute entries from any ring R in place of the 
real numbers, and we obtain a new ring, M,,(R). If R is a ring with identity, then we 
can form the identity matrix, so M,,(R) is also a ring with identity. The conditions 
under which it is a commutative ring are discussed in Exercise 8.10. 


We also have a way of constructing new rings from old, simply extending the idea 
of the direct product of groups. 


Definition 8.4. Let R and S be rings. Then the direct sum of R and S, denoted 
R @ S, is the Cartesian product R x S under the operations 


(r1, 51) + (v2, 82) = (11 #72, 81 + 52) and (71, 81)(72, 52) = (172, 8182), 


for allr; € R, 5; € S. 
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Theorem 8.1. Let R and S be rings. Then R ® S is a ring. Furthermore, if R and 
S are commutative rings, then so is R ® S. Also, if R and S are rings with identity, 
then soisR® S. 


Proof. The proof is very similar to that of Theorem 3.1. The ring properties all hold 
in the direct sum because they hold in R and S. We will prove one of the distributive 
laws, and leave the rest as Exercise 8.6. 

Take r; € R, 5; € S. Then 


(1, 51) (72, 52) + (73, $3)) = 11, 51) (12 +73, 52 + 53) 
= (ri (72 +73), 81 (82 + 83)) 
= (riro + rir3, $152 + $153) 
= (rirz, 5152) + (rirs, 5153) 


= (1, S1)(r2, $2) + (1, 51) (73, 53). 


Example 8.5. InZs@Z¢, we have (3, 5)-+(4, 2) = (7, 7) = (2, 1) and (3, 5)(4, 2) = 
(12, 10) = (2, 4). 


One additional point is important to keep in mind. A ring is a group under addition, 
not under multiplication! While the multiplication operation satisfies the closure and 
associativity properties, a ring does not have to have an identity. And even if it does, 
elements do not have to have inverses. For instance, Z has an identity, but there is 
nothing we can multiply by 2 to obtain 1. 


Exercises 
8.1. Write the addition and multiplication tables for the ring Zs. 
8.2. Write the addition and multiplication tables for the ring Z3 @ Zo. 


8.3. Let R = {0, 3, 6, 9, 12} with addition and multiplication in Zs. Is R a ring? If 
so, is it commutative, and does it have an identity? 


8.4. Let R be the set of all functions from R to R, under addition and multiplication 
of functions. Is R a ring? If so, is it commutative, and does it have an identity? 


8.5. Let R be the set of all functions from R to R. Let the addition operation be the 
usual addition of functions, but let the multiplication operation be composition. That 
is, the product of a and B isa o 6. Is R a ring? If so, is it commutative, and does it 
have an identity? 


8.6. Complete the proof of Theorem 8.1. 
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8.7. Let R be the set of matrices of the form ¢ a for alla, b,c € Z. Is R aring 


under matrix addition and multiplication? If so, is it commutative, and does it have 
an identity? 


8.8. Show that every ring with a prime number of elements is commutative. 
8.9. Must a ring with a prime number of elements be a ring with identity? 


8.10. Let R be a ring and n a positive integer. Under what conditions is M,,(R) 
commutative? 


8.2 Basic Properties of Rings 


Let us mention a few straightforward properties of rings. 


Theorem 8.2. Let R be a ring. Then the additive identity, 0, is unique. If R has a 
multiplicative identity 1, then it too is unique. 


Proof. As R is a group under addition, we see from Theorem 3.2 that 0 is unique. 
Suppose that a and b are both multiplicative identities for R. As a is an identity, 
ab = b. But as b is an identity, ab = a. Thus, a = b. 


Theorem 8.3. Let R be a ring. Ifa,b € R, then 


I. 0a = a0=0; 
2. (—a)b = a(—b) = —(ab); and 
3. (—a)(—b) = ab. 


Proof. (1) As 0 = 0+ 0, we have 0a = (0+ 0)a = 0a + 0a. Adding —0Oa to both 
sides, we get 0 = Oa. The proof that a0 = 0 is similar. 
(2) Notice that ab + (—a)b = (a + (—a))b = 0b = 0, by (1). As adding (—a)b 
to ab gives 0, we have (—a)b = —(ab). The proof that a(—b) = —(ab) is similar. 
(3) By (2), we have (—a)(—b) = —(a(—b)) = —(—(ab)). But remember that R 
is a group under addition, and hence —(—(ab)) = ab, as required. 


Corollary 8.1. [f R is a ring with identity, then (—1)a = —a, for anya € R. 


Proof. By the preceding theorem, (—1)a = —(la) = —a. 


AS aring is a group under addition, we know from Theorem 3.3 that an expression 
such as a; +d)+---+d, is unambiguous, without the need for brackets. Even though 
the ring is not a group under multiplication, we can apply precisely the same proof 
as that of Theorem 3.3 to show that the expression a,d2---a, also does not require 
brackets. 
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Theorem 8.4. Let R be any ring, and aj, d2,...,d, € R. Then regardless of how 
the product a,a2 - ++ ay is bracketed, the result equals (- - - (((a1a2)a3)a4) ++ + An—1) An. 


In order to avoid mistakes, it is also important to recognize which rules cannot be 
applied in general. For instance, in ordinary arithmetic using the real numbers, we 
take for granted that if ab = 0, then a = 0 or b = 0. This is simply not the case in 
an arbitrary ring. 


Example 8.6. In Ze, we have 2 - 3 = 0, but 2 4 0 and 3 4 0. 


Example 8.7. In M2(R), we have 
12\/(/-2 4\_ (00 
36 1-2) \oo/}’ 
12 00 2 4 
(36) (00) #(7" 4): 


In dealing with groups, we have the cancellation law. We are used to something 


similar happening in ordinary arithmetic; that is, if ab = ac anda # 0, thenb =c. 
Again, this does not have to hold in rings. 


but 


Example 8.8. In Z1, we have 3-1 = 3-5, but3 AO and 1 45. 


Finally, in a group G, we note that if there exists ab € G such that ab = b, thena 
is the identity. (Just multiply on the right by b~'.) But even if a ring has an identity, 
the fact that ab = b does not mean that a = 1. Indeed, the previous example points 
us in the right direction. 


Example 8.9. In Z2, we have 5-3 = 3, but 5 £ 1. 


Thus, to check that a ring element a is the identity, we must make sure that 
ab = b = ba for every b € R, not just for one such b. 


Exercises 


8.11. Let a and b be elements of a ring R. Simplify the following expressions as far 
as possible. 


1. (a+b)(a—b) 
2. (a@—by 


8.12. Let R be a ring with identity. Suppose that there exist a, b,c € R such that 
ab = ba = 1 and ac = 0. Show that c = 0. 


8.13. Let R be a ring with identity. Suppose there exist a,b,c € R such that 
ba = ac = 1. Does it follow that b = c? Show that it does, or find an explicit 
counterexample. 
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8.14. Let R bearing andn > 2a positive integer. Show thatif there existsO Aa eR 
such that a” = 0, then there exists 0 4 b € R such that b> =0. 


8.15. Let R be a ring with identity. Suppose that a(a — 1) = 0 for every a € R. 
Does it follow that a € {0, 1} for every a € R? Either prove that it does, or construct 
an explicit counterexample. 


8.16. Let R be aring in which a* = a for everya € R. 


1. Show that a + a = 0 for everya € R. 
2. Show that R is commutative. 


8.3 Subrings 


Just as we have the notion of a subgroup, we can discuss subrings. 


Definition 8.5. Let R be a ring. Then a subset S of R is said to be a subring if S is 
a ring under the same addition and multiplication operations as in R. 


Example 8.10. We see that Z is a subring of Q, and both are subrings of R. 
Example 8.11. The matrix ring M2(Q) is a subring of M2(R). 
Example 8.12. For any ring R, {0} and R are subrings of R. 

How can we test if a subset is a subring? 


Theorem 8.5. Let R be a ring and S a subset of R. Then S is a subring of R if and 
only if 


I. O€ S; 
2. ifa,b € S, thena— be S; and 
3. ifa,b € S, thenabe S. 


Proof. Suppose that S is a subring of R. Then it is an additive subgroup. By Theo- 
rem 3.13, (1) and (2) hold. As a ring is closed under multiplication, (3) holds as well. 
Conversely, suppose that (1)—(3) hold. Then by Theorem 3.13, S is an additive sub- 
group of R. By (3), S is closed under multiplication. The remaining ring properties 
(associativity and the distributive laws) hold in R, hence in any subset of R. Thus, S$ 
is indeed a subring. 


Note that for condition (1), it is actually sufficient to check that S is not the empty 
set. 


Example 8.13. Let us show that 2Z is a subring of Z. Certainly 0 € 2Z. If 2a, 2b € 
2Z, for some a, b € Z, then 2a — 2b = 2(a — b) € 2Z. Also, (2a)(2b) = 2(2ab) € 
2Z. 


8.3. Subrings 141 
a0 
00 

tains the zero matrix. Also, if a, b € R, then 


(50)- (60) =Cora)es 
(00) (ot) = (00) <5 


Thus, S is a subring of M2(R). 


Example 8.14. Let S = 1ae Rt. Then letting a = 0, we see that S con- 


and 


We recall that the centre of every group is a subgroup. A similar thing happens 
for rings. 


Definition 8.6. Let R be a ring. Then the centre of R is the set {z € R: az = 
za for alla € R}; that is, itis the set of elements of R that commute with everything 
in R. 


Theorem 8.6. The centre of any ring is a subring. 


Proof. Let R be aring and Z its centre. Ifa € R, then0a = 0 = a0, s00 € Z. Take 
any y,z € Z.Thenforanya € R, wehavea(y—z) = ay—az = ya—za = (y—z)a, 
since y and z are central. Thus, y — z € Z. Also, ayz = yaz = yza, and hence 
yz € Z. By Theorem 8.5, we are done. 


Example 8.15. Tf R is a commutative ring, then its centre is all of R. 


Example 8.16. The centre of M2(R) is the set of all matrices of the form ( a. for 


all real numbers r. See Exercise 8.26. 
One particular type of subring deserves special mention. 


Definition 8.7. Let R be a ring with identity 1. Then a subring S of R is said to be 
a unital subring if 1 € S. 


Example 8.17. We observe that Z is a unital subring of Q, but 2Z is not a unital 
subring. 


Note that a subring can fail to be a unital subring because it does not have an 
identity (as is the case with 2Z above), but it can also have an identity which is not 
the same as that for R. 


Example 8.18. Let R = Ze and S = {0, 3}. Theorem 8.5 shows us that S$ is a subring 
of R. It does not contain 1, so it is not a unital subring. However, S is still a ring with 
identity, as 3-0 = 0 and 3 - 3 = 3. That is, 3 is the identity of S. 
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Exercises 


8.17. Let R = {a+ bi : a,b € Z}. Show that R is a subring of C. Is it a ring with 
identity? If so, is it unital? 


Oab 

8.18. Let R = 00c]:a,b,c € R¢. Show that R a subring of M3(R). Is ita 
000 

ring with identity? If so, is it a unital subring? 


Oa 
that R is a subring of M>(R). Is it a ring with identity? If so, is it a unital subring? 


8.19. Let R be the set of matrices of the form (° . for all real numbers a. Show 


8.20. Let R be a ring with subrings S and 7. Show that SM T is a subring. Extend 
this to show the intersection of any collection of subrings of R is also a subring. 


8.21. Let R and S be rings. Show that T = {(r,0):r € R} is asubring of R@ S. 


8.22. Find a ring R and an additive subgroup S of R such that S is not a subring of 
R. 


8.23. Let R be aring anda € R. Show that S = {ra : r € R} is a subring of R. 


8.24. Let R be aring anda € R. Let S = {r € R: ra = 0}. Is S necessarily a 
subring of R? Prove that it is, or find an explicit counterexample. 


8.25. Let R bearing anda ¢€ R. Fix asubring S of R,andletT = {r € R: ra € S}. 
Is T necessarily a subring of R? Prove that it is, or find an explicit counterexample. 


8.26. Show that the centre of M(R) is the set of matrices of the form (( ’). for 
allr ER. 


8.4 Integral Domains and Fields 


Let us discuss a couple of special sorts of rings. 


Definition 8.8. Let R be a commutative ring. Then a nonzero element a € R is said 
to be a zero divisor if there exists a nonzero b € R such that ab = 0. 


Example 8.19. In Ze, we note that 4 is a zero divisor, as 4-3 = 0. On the other hand, 
5 is not a zero divisor. 


Example 8.20. The ring of integers has no zero divisors. 
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As we mentioned at the beginning of the chapter, while we tend to think of the 
integers when we work with rings, they are actually rather special, and this is the 
reason why. 


Definition 8.9. An integral domain is a commutative ring R with identity 1 4 0 
having no zero divisors. 


The condition that 1 # 0 may seem a bit curious. In fact, if 1 = 0, then for any 
a € R, we have a = la = Oa = 0. Thus, R = {0}. So we are only ruling out one 
ring with that restriction. 


Example 8.21. The rings Z, Q, R and C are all integral domains. 


Example 8.22. The polynomial ring R[x] is an integral domain. Indeed, we know 
that it is a commutative ring with identity. Also, if f(x) = ag + ayx +--+ + ay,x" 
and g(x) = bh + bx +--+ + dyx", with a;,b; € Randa, 4 0 # by, then the 
unique term of highest degree in f (x) g(x) is a,b,,x*". As R is an integral domain, 
Anbm # 0. Thus, f (x)g(x) is not the zero polynomial. 


Example 8.23. The rings 2Z, Z¢ and M>(R) all fail to be integral domains. The first 
lacks an identity, the second has zero divisors and the third is not commutative. 


As we discussed in Section 8.2, rings in general do not enjoy a cancellation law. 
However, integral domains do. 


Theorem 8.7 (Cancellation Law). Let R be an integral domain. Suppose that 
a,b,c € Randab=<ac. Ifa £0, thenb=c. 


Proof. If ab = ac, then ab— ac = 0, and hence a(b—c) = 0. Since R is an integral 
domain, either a = 0 (which is not true), or b — c = 0, as required. 


We also wish to discuss a stronger restriction on the ring. We need a definition 
first. 


Definition 8.10. Let R be a ring with identity. Then we say that an element a € R 
is a unit if there exists an element b € R such that ab = ba = 1. In this case, we 
call b the inverse of a and write b = a~!. We write U(R) for the set of all units of 
R, and call it the unit group of R. 


Theorem 8.8. Let R be a ring with identity. Then U(R) is a group under multipli- 
cation. 


Proof. Let a,b € U(R). Then abb“'a~! = ala~! = aa~'! = 1, and b“'!a7!ab = 
b-'1b = b-'b = 1. Thus, b-'a~! = (ab)“!", and ab € U(R). Multiplication in 
a ring is associative. Plainly, 1 © U(R), as 1-1 = 1. Also, if a € U(R), then 
aa~! =a~'a = 1. Thatis, a is the inverse of a~!, hence a~! € U(R). We are done. 


Example 8.24. By definition, U(M,(R)) = GL, (R). 
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Example 8.25. The unit group of Z is {+1}. 
Example 8.26. The unit group of Z, is U(n). See Exercise 8.30. 


Example 8.27. Every element other than 0 in R is a unit. The same can be said for 
Qand C. 


This last example leads us to our next definition. 


Definition 8.11. Let F be a commutative ring with identity | #4 0. Then F is said 
to be a field if U(F) consists of every element of F other than 0. 


Example 8.28. As we noted above, Q, R and C are fields. 


Lemma 8.1. Let R be a commutative ring with identity. Then a unit in R cannot be 
a zero divisor. 


Proof. See Exercise 8.12. 
This immediately yields the following result. 
Theorem 8.9. Every field is an integral domain. 


Of course, the integers are an integral domain, but not a field. However, we can 
say something for finite integral domains. As we might expect, ifa € R, andn isa 
positive integer, we write 

a" =aa---a. 


n times 


Theorem 8.10. Let R be a finite integral domain. Then R is a field. 


Proof. By definition, R is a commutative ring with identity 1 ~ 0. It remains only 
to check that each nonzero element is a unit. Take 0 # a € R. Consider the set 
{a' : i > O}. It consists of infinitely many powers of a. But R is finite. Thus, there 
cannot be infinitely many distinct powers. Let us say that a’ = a/ withi > j > 0. 
Then a/a'~/ = a' = a/. More importantly, a/a'~/ = a/ - 1. Now, a is a nonzero 
element of an integral domain, and products of nonzero elements in such a domain 
do not become zero. Thus, a/ 4 0. By the cancellation law, a'/~/ = 1. Ifi — j = 1, 
then a = 1, which is surely a unit. Otherwise, aa'~/~! = 1. Since i — j — lisa 
positive integer, a‘~/—! € R, and we have an inverse for a. 


We can now handle a particular collection of finite rings of interest. 


Theorem 8.11. Letn > 2 be a positive integer. Then the following are equivalent: 


1. Zy is an integral domain; 
2. Zy is a field; and 


3. nis prime. 


8.4 Integral Domains and Fields 145 


Proof. In view of Theorems 8.9 and 8.10, we know that (1) and (2) are equivalent. We 
need only show that they are equivalent to (3). If n is composite, then write n = kl, 
where k and / are positive integers smaller than n. Then k and / are not 0 in Z,,, and 
yet kl = O0in Z,. Thus, Z,, is not an integral domain. On the other hand, suppose that 
n is prime. Surely Z,, is a commutative ring with identity 1 4 0. Suppose we have 
integers i and j such that ij = 0 in Z,. Then n|ij. By Theorem 2.7, n|i or n|j. That 
is,i = Oor j = 0in Z,,. Thus, Z,, is an integral domain. 


Just as we have subrings, it will also be necessary to know about subfields. 


Definition 8.12. Let F be a field. Then a subring K of F is said to be a subfield if 
it is a field using the same addition and multiplication operations. 


Example 8.29. Q is a subfield of R, which in turn is a subfield of C. 
But how do we test if a subset is a subfield? 


Theorem 8.12. Let F be a field. Then a subset S of F is a subfield of F if and only 
if 
I. le S; 


2. ifa,b € S, thena—be€ S; and 
3. ifa,b € S,andb £0, thenab~' «€ S. 


Proof. Suppose that S is a subfield of F. Then S contains an identity f 4 0. We 
must check that f is 1, the identity of F. But as f is the identity for S, we have 
ff = f. Now, f is a unit in F, so multiplying by f~', we get f = 1. Thus, (1) 
is proved. Since S is a subring of F’, (2) follows from Theorem 8.5. As S is a field, 
every element except 0 has an inverse. This inverse is unique, as U(F’) is a group. 
Therefore, if 0 A b € S, then b~! € S. Since S is a subring, we get (3) as well. 
Conversely, suppose that (1)—(3) hold. In view of (1) and (2), we see that 0 = 
1—1 € S. Take anya, b € S.By(2),a—b € S.Ifb = 0, thenab = 0 € S. Otherwise, 
we have b-' = 1b7! € S, and therefore ab = a(b™')~' € S. By Theorem 8.5, S 
is a subring of F. It certainly has an identity 1 4 0, and it is commutative, since F 
is. Thus, it remains only to check that every nonzero element has an inverse in S. 
But we just did that! If 0 4 b € S, then b~' = 1b7! € S. Therefore, S is indeed a 
subfield of F. 


A small word of caution. It is not sufficient to replace (1) with the condition that 
S is not empty; indeed, if we did so, then we would accept {0} as a field, which is 
wrong. It would be sufficient to assume that S contains a nonzero element b, for then 
(3) would give 1 = bb“! € S. 


Example 8.30. Let F = {a+ bV/2: a,b € Q}. We claim that F is a subfield of R. 
Let us check the conditions. Certainly | = 1+ 0/2 € F, so (1) holds. If a;, b; € Q, 
then (a, +b, V2) — (ay + boV2) = (a) — az) + (b) — bn) V2 € F, and we have (2). 
Let us check the final condition. To begin with, we shall show that F is closed under 
multiplication. But (a, +b, /2)(a2 + boV2) = (ajax + 2b, bo) + (a, bo +anb\) V2 € 
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F. Thus, if we can show that every nonzero element of F' has an inverse in F, 
then we will be done, as we can obtain (3). Take O 4 a + bJ/2¢—F.Ifb= 0, 
then 0 4 a € Q, and certainly a“! € Q C F. Assume that b ¢ 0. Notice that 
(a+ bV/2)(a — bV2) = a? — 2b? € Q. Also, a? — 2b? £ 0. Otherwise, we would 
have (ab—!)? = 2, meaning that J/2 is rational, which is not the case. Thus, a? — 2b? 
has an inverse c € Q. But then (a + bV/2)(ac — beV2) = (a? — 2b?)c = 1. Hence, 
a+ bV/2 has an inverse in F,and F is a subfield of R. 


Exercises 


8.27. Let R = {a+ bi: a,b € Q}. Show that R is a subfield of C. 


8.28. For each of the following rings, which elements are units? Which are zero 
divisors? 


1. Zis 
2. Z3 ® Zo 


8.29. Let R and S be rings with identity. Show that U(R ® S) = U(R) x U(S). 
8.30. Let n > 2 be a positive integer. Show that U(Z,) = U(n). 


8.31. Show that every integral domain contains exactly two elements a satisfying 


a> =a. 


8.32. Let R and S be rings. Under precisely what circumstances is R ® S an integral 
domain? 


8.33. Let F be a field with subfields K and L. Show that K O L is a subfield of F. 
Extend this to show that the intersection of any collection of subfields is a subfield. 


8.34. Let p be a prime and F a field with p? elements. Show that F cannot have 
more than one proper subfield. 


8.35. Let R be an integral domain. Suppose that we have a, b € R such that al? = 
b} and a° = b!°, Show that a = b. 


8.36. Let R be a finite commutative ring having no zero divisors. Show that R is 
{O} or an integral domain. 


8.5 The Characteristic of a Ring 


One rather important property of a ring is its characteristic. Letting R be a ring, recall 
that using additive notation, if we have a € R and some positive integer n, then 


na=a+a+:::+a. 
_—_ 


n times 
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Definition 8.13. Let R be a ring. Then the characteristic of R, denoted char R, is 
the smallest positive integer n such that na = 0 for alla € R. If no such n exists, 
then char R = 0. 


Example 8.31. The characteristic of Z, is n, as clearly na = 0 for any a € Zp, 
whereas no smaller value than n will work if we take a = 1. 


Example 8.32. The ring of integers has characteristic zero. 
In fact, for rings with identity, we only need to look at the identity. 


Theorem 8.13. Let R be a ring with identity. Regarding R as an additive group, if 
the order of | isn < 0, then R has characteristic n. If | has infinite order, then R 
has characteristic zero. 


Proof. If 1 has infinite order, then there is no positive integer n such that n1 = 0, 
and therefore char R = 0. Suppose | has order n < oo. Then no number | < m <n 
can be the characteristic, as m1 ¢ 0. But on the other hand, if a € R, then 


na=a+a+-::--+a=la+lat+-:--4+la=(14+1+4+---+)la=0a=0. 
ee! ee! ee 


n times n times n times 


Thus, 7 is the characteristic. 


Corollary 8.2. Let R be a ring with identity. Then every unital subring of R has the 
same characteristic as R. 


Proof. The same identity has the same order. 


The corollary does not apply to subrings that are not unital. For instance, if R = Ze, 
then char R = 6, but taking the subring S = {0, 2, 4}, we see that char S = 3. 

In a commutative ring of prime characteristic, we have the following interesting 
fact. 


Theorem 8.14 (Freshman’s Dream). Let R be a commutative ring of prime char- 
acteristic p. Then for any a,b € R, we have 


(a+b)? =a? +B?. 


Proof. Let us apply the Binomial Theorem. (We are really only familiar with it for 
real numbers, but the proof in any commutative ring is the same.) We have 


(a+b)? =a? + (arto a (2 )ar20 ace (,” abr + bP, 


(?)- p! 
kk} (p—kIkY 


Now, if 1 < k < p, then 
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Notice that the numerator is divisible by p. However, p does not divide any of the 
terms in the denominator and, therefore, it does not divide the denominator. Thus, p 
divides each ( ), with | < k < p. As our ring has characteristic p, multiplying any 
element by p, and hence by any multiple of p, gives 0. We have our result. 


We tend to encounter commutative rings with prime characteristic a lot in the 
context of integral domains. 


Theorem 8.15. The characteristic of an integral domain is either zero or a prime. 


Proof. Let R be an integral domain. There is nothing to do if char R = 0, so let char 
R=n > 0. We cannot have n = 1, for then | has additive order 1, but only 0 has 
that order. The only remaining problem is if n is composite. Suppose that n = kl, 
with 1 < k,/ <n. Then we have 


—S > ee 
kl times k times 7 times 


Since R is an integral domain, kl = 0 or /1 = O. But the additive order of 1 is n, 
and we have a contradiction. 


Exercises 


8.37. Find the characteristic of each of the following rings. 


1. 3Zo1 = {0,3,..., 18} 
2. Rix] 


8.38. Find the characteristic of each of the following rings. 
1. Z4 8 Zio 
2. M2(Z3) 


8.39. Show that a finite integral domain R must have order p” for some prime p 
and positive integer n. 


8.40. Let F bea field of prime characteristic p. Show that for every positive integer 
n, {a € F: a?" =a} isasubfield of F. 


8.41. Let R be a commutative ring with identity, and suppose that a € R satisfies 
a” = 0 for some positive integer n. 


1. Show that 1+ a € U(R). 
2. If char R is prime, show that 1 + a has finite order in U(R). 


8.42. Let F = {0, 1, a,b} be a field with four elements. Write the addition and 
multiplication tables for F’. 


Chapter 9 M®) 
Ideals, Factor Rings and ra 
Homomorphisms 


We saw in Chapter 4 that for some purposes, subgroups are not quite good enough; we 
needed to consider normal subgroups. There is a similar concept in ring theory. In this 
chapter, we introduce the notion of an ideal, which is a subring with one additional 
condition imposed. We can then define factor rings and discuss ring homomorphisms 
and isomorphisms. Along the way, we will mention several important sorts of ideals, 
including principal, maximal and prime ideals. 


9.1 Ideals 


When we discussed normal subgroups of a group, our main concern was to find a 
condition that we could impose in order to make the group operation on a factor group 
well-defined. Our motivation is the same here. Of course, a subring is necessarily 
an additive subgroup of a ring, and as the additive group is abelian, we do not have 
to worry about normality. However, we need an additional condition to make the 
multiplication operation work properly. 


Definition 9.1. Let R be a ring. Then a subring / of R is said to be an ideal if 
ir,ri € I foralli € J andr € R. We call this the absorption property. 


Note that closure under multiplication is not enough. We need to be able to multiply 
an element of the ideal by any element of the ring and stay within the ideal. Combining 
the definition with Theorem 8.5, we immediately obtain the following. 


Theorem 9.1. Let R be a ring and I a subset of R. Then I is an ideal if and only if 


1. Oe T; 

2. i—j €T foralli, j € I; and 

3. ir,ri eT foralliel,re R. 
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Example 9.1. Letn be any integer. Then nZ is an ideal of Z. Indeed, we already know 
that it is a subring. But also, if nk € nZ, then for any integer r, r(nk) = n(rk) € nZ. 


Example 9.2. Let I be the set of all polynomials f(x) € R[x] such that f(0) = 0. 
We claim that / is an ideal in R[x]. Certainly J contains the zero polynomial. Also, if 
f 0) = g(0) = 0, then (f —g)(0) = f(0)—g(0) = 0, hence f (x)— g(x) € I. Also, 
if f(0) = 0 and h(x) € R[x], then A(O) f (0) = h(0)0 = 0. Hence, h(x) f(x) € I. 


Example 9.3. Let I be the set of all polynomials in Z[x] whose constant term is a 
multiple of 5. Then J is an ideal. See Exercise 9.2. 


Example 9.4. For any ring R, {0} and R are ideals of R. 


Example 9.5. In Q, the ring of integers is a subring but not an ideal. Indeed, 3 € Z, 
but 3(1/5) ¢ Z. Thus, Z does not have the absorption property. 


Actually, this last example is not particularly surprising. Fields do not have inter- 
esting ideals, as the following results illustrate. 


Theorem 9.2. Let R be a ring with identity. If an ideal I of R contains a unit, then 
IT=R. 


Proof. Let u € I be a unit. Then by the absorption property, 1 = uu! € I. But 
then for any a € R, we also have a = la € J. Thus, J = R. 


Corollary 9.1. Let F be a field. Then the only ideals of F are {0} and F. 


Proof. Let I be an ideal of F. If J = {0}, there is nothing to do, so assume that 
0 Aa é F. Thena is a unit, and by the preceding theorem, J = F. 


There are a number of ways in which we can obtain new ideals from old ones. 
For instance, let J and J be ideals of a ring R. Then we write 7+ J={i+j:i¢€ 
T,j¢€ J}. 


Example 9.6. In Z, if we let 1 = 4Z and J = 6Z, then J] + J = 2Z. Indeed, if 
meéeI+J,thenm = 4a+6b = 2(2a+ 3b), for some integers a and b. In particular, 
I+ J C 2Z,. On the other hand, for any c € Z, 2c = 4(—c) + 6c € 1+ J, and hence 
2Z, C I+ J. We can say something more general here. See Exercise 9.4. 


Theorem 9.3. [f I and J are ideals of a ring R, then sois 1+ J. 


Proof. As I and J are subgroups of the abelian additive group R, and hence normal, 
we know from Theorem 4.5 that J+ J is an additive subgroup of R. It remains only to 
check the absorption property. Takei € J, 7 €¢ Jandr € R.Thenr(i+j) =rit+rj. 
Now, ri € J andrj € J. Thus, r(i+ j) ¢ 7+ J. Similarly, (G+ j)r =ir+ jr € 
I+J. 


Also, we can define JJ = {i, jj + injo +--+ +injn i ix € 1, jp € J,n © N}. (We 
cannot simply take terms of the form ij withi € J and j € J, as sums of terms of 
that form cannot necessarily be written in the same form.) 
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Theorem 9.4. Let I and J be ideals ina ring R. Then I J is also an ideal. 


Proof. Clearly 0 = 0-0 € IJ. If we have iz, € T, jx € J, then 


Giji Bl eae imjm) — Gn+tjm+i a Sei in Jn) 
=Nj1 + ore + imJm + (—im41) Jm41 + Bs + (—in) jn € IJ. 


Also, for any r € R, 
ri ji free t imjJm) = (rij Sie (Tim) jm els 
since rij,..., im € I and, similarly, 


(ijt aaa imJm)I = (fir) a im (m1) elJ. 


Notice that J and J are both subsets of J + J, as we can take i + 0 and 0+ j for 
any i € I, j € J. But by the absorption property, JJ CIN J. 
One type of ideal is particularly important. 


Definition 9.2. Let R be a commutative ring with identity and a € R. Then the 
principal ideal generated by a, denoted (a), is the set {ra : r € R}. 


Example 9.7. In Z, we have (n) = nZ for any n € Z. 


Example 9.8. The ideal from Example 9.2 is (x). Indeed, if f(x) € R[x], then 
f() = 0 if and only if the constant term is 0; that is, if and only if the polynomial 
is a multiple of x. 


Theorem 9.5. /f R is a commutative ring with identity, and a € R, then (a) is an 
ideal of R; indeed, it is the intersection of all ideals of R containing a. 


Proof. We have 0 = Oa € (a). Ifr,s € R, thenra — sa = (r —s)a € (a). Also, 
if ra € (a) ands € R, then s(ra) = (ra)s = (sr)a € (a). Furthermore, if J is an 
ideal of R containing a, then by the absorption property, ra € J for allr € R. Thus, 
(a) is a subset of every ideal containing a. As (a) is an ideal containing a, our result 
is proved. 


Notice that the preceding proof does not work if R is not a commutative ring with 
identity. Indeed, if R is not commutative, then {ra : r € R} need not be an ideal; if 
R does not have an identity, then it may not contain a. See Exercise 9.5. 

In a similar fashion, if R is a commutative ring with identity, and a),...,d, € R, 
we can construct the ideal generated by a), ...,a,, namely, the set of all elements 
rj}aqy +--+ ryan, With r; € R. 


Exercises 


9.1. List all of the elements in each of the following principal ideals of Z4 @ Ze. 
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1. ((0,5)) 
2. ((2, 3)) 


9.2. Let J be the set of all polynomials in Z[x] whose constant term is a multiple 
of 5. Show that J is an ideal of Z[x]. 


9.3. Show that the intersection of ideals J and J in a ring R is also an ideal. Extend 
this to the intersection of an arbitrary collection of ideals. 


9.4. Let m and n be positive integers. Show that mZ + nZ = (m, n)Z. 


9.5. Given aring R and an element a, let S = {ra : r € R}. Show by example that 


1. Sneed not contain a, even if R is commutative; and 
2. Sneed not be an ideal, even if R has an identity. 


9.6. Let R be a commutative ring. If ] = {a € R: a" = 0 for somen € N}, show 
that J is an ideal of R. 


9.7. Find ideals 7 and J inaring R such thatlJ AIM J. 


9.8. Let R be acommutative ring with identity having exactly two ideals. Show that 
R is a field. 


9.9. Consider the additive group G and subgroup H from Exercise 3.42. Define a 
multiplication operation on G via (a), dz, ...)(b1, b2, ...) = (ab), agb2, .. .). Show 
that G is a ring and H is an ideal. 


9.10. In the preceding exercise, show that H is not a principal ideal. 


9.2 Factor Rings 


Let R be a ring and J an ideal. Then R is an abelian group under addition, and J is 
a subgroup. Thus, we can consider the left cosets a + I, for alla € R. We use these 
to form a factor ring. Remember that a + J = b+ / if and only ifa —beT. 


Definition 9.3. Let R be aring and J an ideal of R. Then the factor ring (or quotient 
ring), R/J, is the set of all left cosets {a + J : a € R} together with the operations 
(a+1)+(b+1)=a+b+TJ and (a+/)(b+/) =ab+T,foralla,be R. 


Theorem 9.6. For any ring R and ideal I, the factor ring R/I is a ring. 


Proof. Since R isan abelian group under addition, J is necessarily anormal subgroup. 
Thus, we know from Theorem 4.6 that R/J is a group under addition. By Theorem 4.7, 
it is abelian. 

Let us show that the multiplication operation is well-defined. Suppose that a,;+J = 
a, + J] andb, +] = b, + I. Then notice that 
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ab — aybz = a,b, — ayb2 + aybz — azbz = ay (bi — bz) + (ay — a2)bo. 


Now, b; — b2 € I. Thus, by absorption, a, (b; — bz) € I. Similarly, a; — a) € T, and 
hence (a, — az)b2 € I. Thus, a,b, — azb2 € I, and therefore a,b; + I = aobo + I. 
That is, the multiplication operation is well-defined. 

We must check the remaining properties of a ring. Take any a,b,c € R. Then 
since ab + I € R/I, we have closure under multiplication. Also, 


(a+ D(b+D(c+D)) = (at Dbe+ 1 
=a(bce) +1 
= (ab)c +1 
= (ab+I)(c+1) 
=(a+NDb6+ND))c+D), 


and associativity holds. Similarly, 


(a+ ID(b+HN+c+))=@4+Db+c4+) 

a(b+c)+I1 

ab+ac+I 
(ab+1)+(ac+T) 
=(a+Ib+1)+(a+Icrt+!). 


The other distributive law is proved in the same fashion. 


Let us discuss a few examples of factor rings. 


Example 9.9. Let R = Zand I = (5) = 5Z. Then R/J = {0+ /,1+1/,2+ 
7,3 + 7,4 + J} and, for instance, (2+ J) + (44+ 7) = 64+/7 = 147 and 
B4+ND44+)=124+7=241. 

Example 9.10. Let R = M2(Z) and let J be the ideal consisting of all matrices 
whose entries are even. Then notice that for any aj; € Z, we have 


a1 7 bio rae 2 
a2) a22 ba baz 


where b;; is 0 if aj; is even and | if a;; is odd. Thus, R/J consists of the sixteen 


different elements @ ! . ) +1, b;; € {0, 1}. We perform arithmetic in the following 
21 


12 
22 
fashion: 


and 
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10 11 11 11 

(13) +7) (C01) +) = Ga) += Go) + 
Example 9.11. Let R = R[x] and J = (x? + 3). Readers familiar with polynomial 
long division will know that if f(x) € R, then f(x) = (x? + 3)q(x) + r(x), where 
q and r are polynomials, with r(x) = a + bx, for some a, b € R. (Those who are 
unfamiliar with polynomial long division can peek ahead to Section 10.1, where it 
will be discussed in more generality.) Since (x? + 3)q(x) € I by absorption, we 
know that elements of R/J are of the form a + bx + I, with a,b € R. Addition 
behaves as expected; for instance, (2+ 3x+J/)+(7—4x+J/) =9-—x+1. To deal 
with multiplication, observe that x? — (—3) € I; thus, x? + J = —3+4+ 1. Therefore, 
we have calculations such as 


G+44 4+ D(-7 +2647) =] 35 — 18 8 7 
= —35 — 18x + 8(-3) + 
= —59— 18x41. 


Let us also record a few basic facts about factor rings. 


Theorem 9.7. Let R be a ring and I an ideal. Then 


1. if R is commutative, then so is R/T; 
2. if R has an identity, then so does R/I; and 
3. ifuis aunit of R, thenu+ I is a unit of R/T. 


Proof. (1) Ifa,b € R, then (a+ J)(b+ 1) =ab+IT=ba+lT=(b+])(a4+I). 
(2) Ifa € R, then (1+ J)(a+J) =a+I/] = (a+/)(1+/). Hence, | + / is the 
identity of R/T. 
(3) Observe that (w+ 7)(u~! +7) = 14-7 = (u7!+7)(u+ 1); thus, (ut+ F7! = 
ul+T, 


Theorem 9.8. Let R be a ring with ideals I and J, such that I © J. Then J/I is 
an ideal of R/T. 


Proof. Wesee that I is a subring of J, and since J enjoys the absorption property in R, 
it enjoys itin J as well. Thus, / is an ideal of J, and so J/I makes sense. Now, 0 € J, 
and thereforeO+J € J/I.Tfj,, jo € J,then(ji+1)—(ft+l)) = Gi-j)t+l € J/T. 
Also, if 7 € J andr € R, then(r+/)(j7 +17) =rj 4+] € J/I, sincerj € J. 
Similarly, (j + D(r + J) = jr +I € J/I. The proof is complete. 


Exercises 
9.11. Let R = Zand / = (5). Write the addition and multiplication tables for R/T. 


9.12. Let R = Zo ® Z4 and J = ((4,2)). Write the addition and multiplication 
tables for R/T. 
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9.13. Let R = R[x] and f(x) = x7 + 6x? +2. If J = (f(x)), calculate the product 
(4e" + 3 1 DOr? ae 2 ie R/T. Reduce the answer to the form 
ax? +bx +c+1, forsomea,b,c ER. 


9.14. Let R be a ring and / an ideal. Show that R/J is commutative if and only if 
ab—baeéTlforalla,be R. 


9.15. Let J and J be ideals in aring R. Show that R/J and R/J are both commutative 
rings if and only if R/(7 M J) is commutative. 


9.16. Let R be a ring and / a proper ideal. 


1. If R is an integral domain, does it follow that R/J is an integral domain? Prove 
that it does, or find a counterexample. 

2. If R/T is an integral domain, does it follow that R is an integral domain? Prove 
that it does, or find a counterexample. 


9.17. If F is a field of order 81, what are the possible orders of F/J, where J is an 
ideal of F'? 


9.18. Let 1) C bb C1, C::- be ideals of R. Let 1 =U, hh. 


1. Show that / is an ideal of R. 
2. Suppose that R/J is commutative. Show that for every a, b € R, there exists an 
n €N such that ab — ba € I,,. 


9.19. Let R and J be as in Exercise 9.6. Define the analogous ideal for R//, namely 
fa+1¢€R/I:(a+I)" =0+7 for some n € N}. Show that this ideal is {0 + 7}. 


9.20. If J is an ideal of a ring R, show that the subrings of R/J are precisely of the 
form S/J, where S is a subring of R containing J. Further show that S/J is an ideal 
of R/T if and only if S is an ideal of R. 


9.3. Ring Homomorphisms 


We recall that a group homomorphism is a function from one group to another that 
respects the group operation. There is a similar concept for rings, but both of the ring 
operations must be respected. 


Definition 9.4. Let R and S be rings. Then a ring homomorphism (or, simply, 
homomorphism) from R to S is a function aw : R > S satisfying 


a(r, +12) = a(r)) + a(72) 


and 
a(rir2) = ary )a(r2) 


for allr),r2 € R. 
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Thus, a ring homomorphism is a homomorphism of additive groups, with the 
additional property that it respects the multiplication operation. The kernel is the 
same as the kernel of the additive group homomorphism. 


Definition 9.5. Let a : R — S be aring homomorphism. Then the kernel of a is 
ker(a) = {r € R: a(r) = O}. 


Example 9.12. Letn > 2 be a positive integer. Then a : Z > Z,, given by a(a) = 
[a] (where we insert the square brackets for clarity) is a homomorphism. Indeed, by 
Example 4.10, it respects the addition operation. Also, for any a,b € Z, a(ab) = 
[ab] = [a][b] = a(a)a(b). Furthermore, by Example 4.10, ker(a~) = nZ. 


Example 9.13. Definea : C > C viaa(a+ bi) = a — bi, for all a, b € R. Then 


notice that 
a((a + bi) + (c+di)) =a((a+c)+ (+ d)i) 


=a+c—(b+d)i 
= (a — bi) + (c — di) 
=a(a+ bi)+a(c+di), 


for all a, b, c,d € R. Similarly, 


a((a + bi)(c + di)) = a((ac — bd) + (ad + bc)i) 
= ac — bd —(ad+be)i 
= (a — bi)(c — di) 
=a(a+t+ bi)a(c+ di). 


Thus, a is a homomorphism. Also, if a(a + bi) = 0, then a — bi = 0, and hence 
a = b=0. Thus, ker(a@) = {0}. 


Example 9.14. Tf R and S are any rings, then a : R — S given by a(r) = 0 for 
all r € R is a homomorphism. Indeed, by Example 4.13, it is an additive group 
homomorphism, and a(rjr2) = 0 = 0-0 = a(r})a(r2), for all 71,72 € R. The 
kernel of a is R. 


Let us record a few basic properties of ring homomorphisms. If a : R > S$ 
is a ring homomorphism, then as with group homomorphisms, we write a(M) = 
{a(m) :m € M} anda7!(N) = {r € R: a(r) € N}, for any subring M of R, and 
any subring N of S. 


Theorem 9.9. Leta: R — S bearing homomorphism. Then 


I. ker(a) is an ideal of R; 

2. a is one-to-one if and only if ker(a) = {0}; 

3. a(0) = 0; and 

4. if R is a ring with identity, then so is a(R), and a(1) is the identity of a(R). 
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Proof. (1) By Theorem 4.11, ker(@) is an additive subgroup of R. It remains to check 
the absorption property. But if k € ker(aw) andr ¢€ R, then a(rk) = a(r)a(k) = 
a(r)O = 0, and hence rk € ker(a@). Similarly, kr € ker(a). 

(2) See Theorem 4.11. 

(3) This follows immediately from Theorem 4.10. 

(4) Ifr € R, thena(la(r) = ar) =a(r) =a(rl) =a(r)a(1). 


We must be a bit careful with the final part of the last theorem. It certainly does 
not follow that a(1) is the identity of S, as the following example illustrates. 


Example 9.15. Define a : R — M>2(R) via 


a(r) = ¢ 0) 


for allr € R. It is easy to verify that w is a homomorphism. Now, a (1) is the identity 
of a(R), but not of M>(R). 


Theorem 9.10. Leta: R— S bearing homomorphism. Let M be a subring of R 
and N a subring of S. Then 


I. a(M) is a subring of S; 

2. if M is an ideal of R, then a(M) is an ideal of a(R); 
3. a !(N) is a subring of R; and 

4. if N is an ideal of S, then a~!(N) is an ideal of R. 


Proof. (1) By Theorem 4.12, w@(M) is an additive subgroup of S. If m;,m2 € M, 
then a(m,)a(m2) = a(mym2) € a(M), since mym2 € M. 

(2) By (1), it remains only to check the absorption property. Ifm € M,r € R, 
then a(r)a(m) = a(rm) € a(M), sincerm € M. Similarly, a(m)a(r) € a(M). 

(3) By Theorem 4.12, a—!(N) is an additive subgroup of R. If ry, r2 € a !(N), 
then a(rjr2) = a(r})a(r2) € N, since a(r1), a(r2) € N. Thus, ryr2 € a! (N). 

(4) In view of (3), we only need to check the absorption property. Takea € a~!(N) 
andr € R. Then a(ra) = a(r)a(a) € N, since a(a) € N, and N is an ideal of S. 
Therefore, ra € a~!(N). Similarly, ar € a~!(N). 


Once again, we note that the second part of the preceding theorem does not say 
that ~a(M) is necessarily an ideal of S. 
One more homomorphism will prove useful later. 


Theorem 9.11. Let R be a ring with identity of characteristic n. Then there is a 
homomorphism a: Z — R with kernel (n). 


Proof. Define a : Z > R via a(k) = kl, for all k € Z. Let us check that @ is a 
homomorphism. If j,k € Z,thena(j+k) = (j+k)1 = j1+k1 = a(j)+a(k). (This 
is Theorem 3.6, using additive notation.) Also, a( jk) = (jk)1, whereas a(j)a(k) = 
(j1)(K1). Again by Theorem 3.6, (jk)1 = j(k1). If 7 > 0, then 


158 9 Ideals, Factor Rings and Homomorphisms 


(DK = Ate +k) Hk +--+ +L = JD. 


j times j times 


If 7 < 0, then (jk)l = (—j)(—A))1 and (JI) (KL) = (—j1)(—k1), and we can 
use the 7 > O argument. If 7 = 0, then (jk)1 = (j1)(k1) = 0. Thus, in any case, 
a(jk) = a(j)a(k), and w is ahomomorphism. 

We note that k € ker(a) if and only if k1 = 0. Ifn = 0, then by Theorem 8.13, 1 
has infinite additive order. Therefore, the only solution is k = 0; thus, ker(a@) = (0). 
If n > O, then by Theorem 8.13, 1 is the additive order of 1. Furthermore, by 
Corollary 3.2, k1 = 0 if and only if the additive order of 1 divides k; that is, if and 
only if n divides k. In other words, the kernel is the set of multiples of n. The proof 
is complete. 


Exercises 


9.21. Decide if each of the following functions is a ring homomorphism. 


1. a:Z—>R,a(a) = 2a. 
2. a: R[x] > R,a(f(x)) = f(2). 


9.22. Decide if each of the following functions is a ring homomorphism. 


1. a: M>(R) > M,(R), a(A) = € ) A ( . 
2. a : M>(R) > R, a(A) = det(A). 


9.23. Leta : R — Sand 6 : S > T be ring homomorphisms. Show that Ba : 
R — T is also a ring homomorphism. 


9.24. Leta: R > Sand f: S > T bering homomorphisms. Show that ker(@) C 
ker(Ba). If 6 is one-to-one, show that ker(6a) = ker(q@). 


9.25. Definea: Z@6Z > Z@Z via a((a, b)) = (a, 0). Is this a ring homomor- 
phism? If so, find ker(@) and a~!(2Z ® 3Z). 


9.26. Define a : Zg > Zyo6 via a([a]) = [a], for all a € {0,1,..., 7}, where the 
square brackets represent the congruence classes. Is this a ring homomorphism? If 
so, find ker(w) and a~!([3]). 


9.27. Let R be aring and / an ideal. Show that there exist a ring S and a homomor- 
phism aw : R — S such that ker(a) = I. 


9.28. Let R be a commutative ring with prime characteristic p. Show that a : R > 
R given by a(a) = a? is a ring homomorphism. 


9.29. Let F bea field of order 16 and K a field of order 4. Find all homomorphisms 
from F to K. 


9.30. Find all homomorphisms from Z to Q. 
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9.4 Isomorphisms and Automorphisms 


As with groups, we use isomorphisms to establish if two rings have the same structure. 


Definition 9.6. Let R and S be rings. Then a ring isomorphism (or, simply, iso- 
morphism) is a bijective homomorphism from R to S$. When such an isomorphism 
exists, we say that R and S are isomorphic rings. 


Example 9.16. Consider the function a : Zs; > Z3@Zs given by a(a) = (a, a) for 
alla. By Example 4.16, this is an isomorphism of additive groups. We claim that it is, 
in fact, aring isomorphism. All that remains is to show that w respects multiplication. 
But if a, b € Zs, then a(ab) = (ab, ab) = (a, a)(b, b) = a(a)a(b). Thus, Zs and 
Z3 ® Zs are isomorphic rings. 


We must not, however, make the mistake of thinking that rings that are isomorphic 
as additive groups are necessarily isomorphic as rings! 


Example 9.17. Let R = Z and S = 5Z. As additive groups, these are isomorphic, 
since 5Z is infinite cyclic, being generated by 5, and we can apply Theorem 4.14. 
However, as rings, there cannot even be an onto homomorphism from R to S. Why 
not? If there were, by Theorem 9.9, | would have to map to the identity of S, which 
is sadly lacking. Thus, R and S are not isomorphic rings. 


Example 9.18. Define a : C — M>(R) via 


a(a + bi) = & ; 


for all a, b € R. Let us check that a is ahomomorphism. If a, b, c,d € R, then 


a((a+ bi)+ (c+di))=a((a+c)+(b+d)i) 


_ fate —(b+d) 
~\b+d ate 


=a(a+bi)+a(c+di). 


Also, 


a((a + bi)(c + di)) = a((ac — bd) + (ad + be)i) = be +be ac—bd 


a(a + bi)a(c +di) = (; es) ¢ ~) 


and these are the same. Clearly, ker(@) = {0}, so a is one-to-one. Now, a is not onto, 
but we see that C is isomorphic to the image a(C), namely 


ac — bd —(ad + >) 


whereas 
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(fe) saber. 


Let us discuss a few properties of isomorphisms. 
Theorem 9.12. On any collection of rings, isomorphism is an equivalence relation. 


Proof. Reflexivity: The function a : R — R given by a(a) = a for all a is an 
isomorphism. Symmetry: Let a : R — S be an isomorphism. By Theorem 4.13, the 
inverse a! : § — R is an isomorphism of additive groups. We only need to check 
that it respects multiplication. Take any s;, 52 € S, and suppose that a~!(s;) = r;. 
Then a(riro) = a(r))a(ro) = 8152; that is, a@~!(s}52) = rire = a!(sy)a7!(s2). 
Transitivity: Leta : R — S and 6 : S — T be isomorphisms. By Theorem 4.13, 
Boa: R— T isan isomorphism of additive groups. Again, we must check that it 
respects multiplication. Take r,, r2 € R. Then 


(Boa)(rir2) = B(a(rir2)) = B@(ria(r2)) = Bla(ri))B(a(r2)). 


The proof is complete. 


Theorem 9.13. Leta: R — S be a ring isomorphism. Then 


1. if R is commutative, then so is S; 

2. if R has an identity, then so does S; 

3. if R is an integral domain, then so is S; and 
4. if Risa field, then so is S. 


Proof. (1) Take s1, s2 € S. Then s; = a(r;), for some r; € R. Thus, 
$182 = ar )a(r2) = a(r1r2) = a(rary) = a(r2)a(r1) = $251. 


(2) Use Theorem 9.9 and the fact that @ is onto. 

(3) By (1) and (2), S is a commutative ring with identity. If 1 = 0 in S, then 
S = {0}. As @ is bijective, R = {0}, which is impossible. It remains only to check 
for zero divisors. Suppose that s; and sz are nonzero elements of S with s;s. = 0. 
Let us say a(r;) = s;. Then 0 = 5152 = a(r)a(r2) = a(r1r2). AS @ is One-to-one, 
r\r2 = 0. But R is an integral domain, so either r; = 0 or rz = 0, which means that 
5; = Oor s2 = 0, giving us a contradiction. 

(4) Once again, S is a commutative ring with identity and | # 0. Suppose that 
0 #5 € S. Then s = a(r), forsomer € R. Now, r 4 0, sor has an inverse in R. 
Since @ is onto, we know that a(1) = 1. Thus, 


1=a(1) =a¢r7!) =a@a(r™) = sar). 


That is, a(r~') = s~!, and every nonzero element of S has an inverse. 
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Example 9.19. The rings 2Z, M2(R), Ze, Z and Q are all nonisomorphic. Indeed, 
2Z does not have an identity, so it cannot be isomorphic to any of the others, which 
do have an identity. Also, all of the rings are commutative except for M>(IR), so it is 
ruled out. Next, Ze is not an integral domain, but the remaining two are. (Also, it is 
finite and the others are infinite.) Finally, Q is a field but Z is not. 


Given a ring R, we might well ask if it is a subring of a field. For example, Z is a 
subring of Q. But not every ring can be a subring of a field. Indeed, a noncommutative 
ring or a ring containing zero divisors cannot exist inside a field. So, it seems that 
integral domains are a good place to start. In fact, if R is an integral domain, then we 
can construct a field F containing an isomorphic copy of R. The method we will use 
may seem somewhat familiar; actually, it is exactly the way in which Q is constructed 
from Z. We will need something comparable to numerators and denominators. Also, 
the denominator must not be zero. Furthermore, we need some way of recognizing 
that 10/25 = 8/20, for instance. This gives us an idea of how to proceed. 

Let R be an integral domain. Then let S be the Cartesian product R x (R\{0}). 
Note that S is only a set, not a ring. (If R = Z, for instance, when we look at 
(10, 25) € S, we are thinking of the fraction 10/25.) Let us define a relation ~ on 
S via (a, b) ~ (c, d) if and only if ad = bc. (Continuing our parenthetical thought, 
(10, 25) ~ (8, 20).) 

We claim that ~ is an equivalence relation. Reflexivity: As R is commutative, we 
see that (a, b) ~ (a, b). Symmetry: Suppose that (a, b) ~ (c,d). Then ad = bc and 
again, by commutativity, (c,d) ~ (a, b). Transitivity: Suppose that (a, b) ~ (c, d) 
and (c,d) ~ (e, f). Then ad = bc, and hence adf = bcf. Also, cf = de, and 
hence bef = bde. Thus, adf = bde. Now, d ¢ Oand R is an integral domain. Thus, 
we have af = be, and hence (a, b) ~ (e, f). For the sake of simplicity, write [a, b] 
for the equivalence class of (a, b). (In our example in Z, we have [10, 25] = [8, 20], 
and this is the set of all pairs (a, b), witha, b € Z, b 4 0 and 10b = 25a.) Let us 
write F for the set of all equivalence classes of S. 

The addition and multiplication operations on F work precisely as we would 
expect with fractions. Specifically, 


[a, b] + [c, d] = [ad + bc, bd] 


and 
[a, b][c, d] = [ac, bd]. 


We must verify that these operations are well-defined. Suppose that [a,, b,] = [a, b] 
and [e1; d,] = [c, d\. Then [a1, by] + [c1, d,| = [a\d, + bic, b,d,]. But 


(ad + bc)(b\d;) = (a\d, + b\c\)bd = dd, (ab, = a,b) + bb, (cd, = cd) 
= dd,(0) + bb, (0) 
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thus, [a, b] + [c, d] = [a1, b1] + [c1, di]. Similarly, [a1, b)]le1, di] = [aici, bids], 
and 
acbh\d, — a,c\bd = (ach,d, = a,cbd,) + (a,cbd, = a,c\bd) 


= cd, (ab, = a,b) + a,b(cd, = cd) 
= cd,(0) + a,b(0) 


thus, [a, b][c, d] = [a,, bi ][c1, d;]. Also, we must note that if b and d are nonzero, 
then so is bd, since R is an integral domain. 


Definition 9.7. Let R be an integral domain. Then the field of fractions (or field of 
quotients) of R is the field F' constructed above. 


Of course, the fact that F is indeed a field needs proving! 
Theorem 9.14. Let R be an integral domain. Then the field of fractions, F, is a field. 


Proof. The proof of this theorem is not difficult. However, there are many steps to 
complete, as we must verify that F is an abelian group under addition, then that it 
has all of the remaining properties of a ring, and finally that it is a field. We will 
prove a few selected properties, and leave the rest to the reader! as Exercise 9.38. 

Take any a, b,c,d,e, f € R with b, d and f all nonzero. Let us show that the 
addition operation on F is associative. We have 

(La, b] + [e, d]) + le, f] = lad + be, bd] + le, f] 

= [adf + bcf + bde, bdf | 
= [a,b] + {cf +de, df] 
= [a, b] + ([c, d] + [e, f]), 
as required. 

Next, let us prove a distributive law. Observe that 


la, b\([c, d] +e, fl) = Ia, bllef + de, df] = lacf + ade, bdf] 
whereas 
[a, bl[c, d] + [a, b][e, f] = Lac, bd] + [ae, bf] = [abcf + abde, bdf]. 


But as (acf + ade)b*df = (abcf + abde)bdf , these are equal. 

Notice that [0, 1] is the additive identity of F and [1, 1] is the multiplicative 
identity. Let us show that every nonzero element of F has an inverse. Take a nonzero 
[a, b] € F. Note that [0, b] = [0, 1], soa # 0. But then [b, a] € F as well, and 
[a, b][b, a] = [ab, ab] = [1, 1]. Thus, [b, a] = [a, db]. 


' Aren’t you lucky! 
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What is the connection between R and F’? The idea is that F contains a copy of 
R and it is, in a sense, the smallest field that does. 


Theorem 9.15. Let R be an integral domain and F its field of fractions. Then F has 
a subring isomorphic to R. Furthermore, if K is any field having R as a subring, 
then K has a subfield isomorphic to F, and this subfield contains R. 


Proof. Define a : R > F via a(r) = [r, 1], for all r € R. We claim that @ is a 
homomorphism. If 7,, 7. € R, then 


a(n tro) =[n +r, = In, U+[r, 1) =e) + a2) 


and 
a(rir2) = [nire, Y= tn, Ur, 1 = ar e(r2), 


proving the claim. Furthermore, ifr € ker(q), then [r, 1] = [0, 1], and therefore r = 
0. Thus, a is one-to-one, and hence R is isomorphic to a(R) which, by Theorem 9.10, 
is a subring of F. 

Also, define 8 : F + K via B([a, b]) = ab™', foralla, b € Rwithb # 0. (Since 
K isa field, b has an inverse in K. But we still need to check that f is well-defined. 
Suppose that [a, b] = [a, b;]. Then ab, = a,b, and hence ab! = ab, '.) Now, let 
us show that 6 is a homomorphism. But 


B({a, b] + [c, d]) = B({ad + bc, bd}) 
= (ad + be)(bd)"' 
=ab-!+cd7! 
= B((a, b]) + Bile, d]). 


Similarly, 
Bila, bllc, dl) = B(lac, bd]) = ac(bd)~! = ab~'cd7' = B({a, b]) B([c, d]). 


Thus, 6 isahomomorphism. Now, ker(#) is an ideal of F’. By Corollary 9.1, ker(B) = 
{0} or F. But 6([1, 1]) = 1 ¥ 0, and therefore 6 is one-to-one. Thus, K has a subfield 
B(F), which is isomorphic to F’. Also, foranyr € R, wehaver = B([r, 1]) € B(F). 
Thus, R is a subring of the isomorphic copy of the field of fractions of R contained 
in K. 


Example 9.20. As we mentioned above, the field of fractions of Z is Q. 


Example 9.21. Let R = {a+ bJ/2:a,be Z}. Then R is an integral domain, and 
its field of fractions is isomorphic to {a + b./2:a,be Q}. See Exercise 9.36. 


One particular type of isomorphism deserves special mention. 


Definition 9.8. Let R be a ring. Then an automorphism of R is an isomorphism 
from R to R. 
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Example 9.22. For any ring R, the function a : R > R given by a(a) = a for all 
a is an automorphism. 


Example 9.23. The functiona : C > C givenbya(a+bi) = a—bi foralla,be R 
is an automorphism. Indeed, we saw in Example 9.13 that it is ahomomorphism. It is 
immediately obvious that a is one-to-one, and ifa+bi € C, thena(a—bi) = a+bi, 
and therefore a is onto. 


We have something similar to an inner automorphism of a group as well. 


Theorem 9.16. Let R be a ring with identity and u a unit of R. Thena: R — R, 
given by a(a) = u~'au for all a € R, is an automorphism of R. 


Proof. Take a,b € R. Then 
a(a+b) =u '(a+b)u =u 'au+u'bu = a(a) + a(d). 


Also, 

a(ab) = u-'abu = u-'auu'bu = a(a)a(b). 
Thus, a is a homomorphism. If w(a) = 0, then u-'au = O, and therefore a = 
-* ' — yuOu-! = 0. Therefore, a is one-to-one. Finally, take any a € R. 


uu auu 
Then a(uau—!) = u-!uau~'u = a. Thus, a is onto, and the proof is complete. 


Exercises 


9.31. Explain why the following pairs of rings are not isomorphic. 
1. Za ® Z4 and Z4 ®@ Zo @ Zo 

2. Z[x] and 2Z[x] 

9.32. Explain why the following pairs of rings are not isomorphic. 


1. Rand M2(R) 
2. RandR@R 


9.33. Show that if two rings are isomorphic, then their centres are isomorphic. 
9.34. Let R and S be any rings. Show that R @ S is isomorphic to S @ R. 


9.35. Let F be a field. Show that F is isomorphic to its field of fractions by con- 
structing an explicit isomorphism. 


9.36. Let R = {a+ b/2:a,be Z}. Show that R is an integral domain, and that 
its field of fractions is isomorphic to {a + b./2:a,be€ Q}. 


9.37. Let R and S' be integral domains. If the fields of fractions of R and S are 
isomorphic, does it follow that R and S are isomorphic? Prove that it does, or give 
an explicit counterexample. 
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9.38. Complete the proof of Theorem 9.14. 


9.39. Let R be a ring. An involution on R is a function a : R — R such that, 
for all r; € R, we have a(r; + r2) = a(r}) + a(r2), a(rir2) = a(r2)a(r}) and 
a(a(r;)) = r,. Show that the following functions @ are involutions on M)(R). 


ab ac . . 
ae ((: ‘)) ~ 6 5) (called the transpose involution) 


: P d —b . . i 
moe ((: ‘)) _ i r ) (called the symplectic involution) 


9.40. Let R be a ring. Using the definition of an involution from the preceding 
question, 


1. determine under what circumstances an involution on R is an automorphism; and 
2. show that the composition of two involutions on R is an automorphism. 


9.5 Isomorphism Theorems for Rings 


We recall that the three isomorphism theorems for groups were presented in 
Section 4.5. Let us now state the analogues for rings. The first is certainly the most 
important. 


Theorem 9.17 (First Isomorphism Theorem for Rings). Leta : R — S bearing 
homomorphism. Then R/ ker(a) is isomorphic to a(R). 


Proof. Let K = ker(q@). Define 6B : R/K — a(R) via B(a + K) = a(a). From the 
proof of Theorem 4.18, we see that f is an isomorphism of additive groups. Thus, it 
remains only to check that 6 respects multiplication. Take any a, b € R. Then 


B(a+ K)(b+ K)) = Blab + K) = a(ab) = a(aja(b) = Bla+ K)B(b+ K), 


as required. 


Whenever we are asked to show that a ring modulo an ideal is isomorphic to 
some other ring, it is usually a good indication that we should employ the First 
Isomorphism Theorem. 


Example 9.24. We already know that for any n > 2, the additive groups Z/nZ and 
Zn are isomorphic. Indeed, in Example 4.21, we showed that a : Z > Z,, given 
by a(a) = [a], is an onto group homomorphism with kernel nZ. But, in fact, we 
also have a(ab) = [ab] = [a][b] = a(a)a(b), for all a, b € Z. Thus, a is actually 
an onto ring homomorphism, and we now know that Z/nZ and Z,, are isomorphic 
rings. 
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Example 9.25, Let us show that R[x]/(x) is isomorphic to R. To this end, let us 
define a : R[x] > R viaa(f(x)) = f (0). Now, if f(x), g(x) € R[x], then 


a( f(x) + g@)) = FO) + gO) = a(f(x)) + a(g(x)). 


Furthermore, 


a(f(x)g(x)) = f(O)gO) = a(f(x))a(g(x)). 


Thus, @ is a homomorphism. Also, if r € R, then simply regarding r as a constant 
polynomial, we have a(r) = r; hence, a is onto. The kernel of a is the set of 
all polynomials f(x) satisfying f(0) = 0. But f(0) is the constant term of the 
polynomial. Thus, ker(q@) is the set of all polynomials with zero constant term, that 
is, the set of all polynomials that are multiples of x. Now apply Theorem 9.17. 


A couple of rather interesting consequences follow. 


Corollary 9.2. Let R be a ring with identity of characteristic n. If n = 0, then R 
has a subring isomorphic to Z. If n > 2, then R has a subring isomorphic to Zy. 


Proof. By Theorem 9.11, there is a homomorphism a : Z — R with kernel nZ. 
Now, Theorem 9.10 says that w(Z) is a subring of R, and Theorem 9.17 tells us that 
this subring is isomorphic to Z/nZ. Ifn = 0, then nZ = {0}, and there is nothing to 
do. Otherwise, we use Example 9.24. 


Corollary 9.3. Let F be a field. If F has characteristic 0, then F has a subfield 
isomorphic to Q. If F has prime characteristic p, then F has a subfield isomorphic 
to Lp. 


Proof. If char F = p > O, then we use the preceding corollary. If char F = 0, 
then we note that F has a subring isomorphic to Z. By Theorem 9.15, F also has a 
subfield isomorphic to the field of fractions of Z, namely, Q. 


The subfield discussed in Corollary 9.3 (either Q or Z,) is the smallest subfield 
of F, and it is called the prime subfield. 


Theorem 9.18 (Second Isomorphism Theorem for Rings). Let R be a ring with 
ideals I and J. Then I/(I 1 J) is isomorphic to I + J)/J. 


Proof. Definea : I > I+ J)/J viaali) = i+ J, for all i € 7. Consulting 
the proof of Theorem 4.19, we see that a is an onto homomorphism of additive 
groups with kernel J M J. In view of the First Isomorphism Theorem for Rings, 
it suffices to show that a respects multiplication. But for any i,,i2 € J, we have 
a(iji2) = ig + J = Gi + J) (in +: J+) = (i, av (in). The proof is complete. 


Example 9.26. Let R = Z, I = (4) and J = (6). Then the preceding theorem 
tells us that (4)/((4) M (6)) is isomorphic to ((4) + (6))/(6). That is, (4)/(12) is 
isomorphic to (2)/(6). 
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Theorem 9.19 (Third Isomorphism Theorem for Rings). Let R be a ring, and let 
I and J be ideals of R with I © J. Then (R/1I)/(J/1) is isomorphic to R/J. 


Proof. Definea: R/I > R/J viaa(a+ 1) =a+ J, for alla € R. The proof of 
Theorem 4.20 shows us that @ is an onto additive group homomorphism with kernel 
J/I.1t remains only to show that a respects multiplication, for then we can apply 
Theorem 9.17. Take a, b € R. Then 


a((at+I)(b+/])) =a(ab+J)=ab+J=(a+J)(b+ J) =a(at+)a(b+/). 


We are done. 


Exercises 
9.41. Let R and S be rings. Show that (R ® S)/(R @ {0}) is isomorphic to S. 


9.42. Let m and n be positive integers, both greater than 1. Show that the rings 
(Z ® Z)/((m) ® (n)) and Z,, @ Z, are isomorphic. 


9.43. Let I be the set of all polynomials f(x) € Z[x] such that the constant term 
of f(x) is a multiple of 5. Show that Z[x]/J is isomorphic to Zs. 


9.44, Let J be the set of all matrices in M(Z) in which every entry is even. Show 
that M>(Z)/TI is isomorphic to M2(Zz). 


9.45. Show that the rings (3Z/60Z)/(12Z/60Z) and 3Z/12Z are isomorphic. Then 
show that both are isomorphic to Z4. 


9.46. Let J and J be ideals ina ring R such that J + J = R. Show that R/(IN J) 
is isomorphic to (R/T) ® (R/J). 


9.6 Prime and Maximal Ideals 


We conclude this chapter by discussing two special sorts of ideals. 
Definition 9.9. Let R be a ring. An ideal M of R is said to be maximal if 


1. M # R;and 
2. if J is an ideal of R containing M, then J = M or I = R. 


Example 9.27. Let R = Zand let n be a nonnegative integer. Then we claim that (1) 
is a maximal ideal of R if and only if n is prime. Indeed, (0) is certainly not maximal, 
as (0) € (2) € R. Also, (1) is not maximal, since (1) = R. If n is composite, say 
n= kl, with 1 < k,l <n, then we note that (n) € (k) € R, so (n) is not maximal. 
Finally, let n be prime. Suppose that J is an ideal of R with (n) ¢ J © R. Take 
a € I\(n). Since a is not divisible by n, and n is prime, we know that (a,n) = 1. 
Thus, by Corollary 2.1, we can find integers uv and v such that au + nv = 1. But as 
a,n € TI, this implies that | € 7, hence J = R, giving us a contradiction. (As we 
shall see shortly, there is another way to prove this.) 
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Example 9.28. In any field, the ideal {0} is maximal! Remember, by Corollary 9.1, 
a field only has two ideals. 


In a commutative ring with identity, there is a nice test for maximality of ideals. 


Theorem 9.20. Let R be a commutative ring with identity, and M an ideal of R. 
Then M is maximal if and only if R/M is a field. 


Proof. Suppose that M is a maximal ideal. By Theorem 9.7, R/M is a commutative 
ring with identity. Furthermore, as M 4 R, we know that R/M consists of more than 
one additive left coset. But the only ring in which 0 = | is the ring consisting only of 
zero; thus,0+ M 4 1+ M. It remains to show that every nonzero element of R/M 
has an inverse. Leta + M#0+M. Now, define J = {m+ra:meéM,re R}. 
We claim that J is an ideal of R. Taking r = 0, we note that m € J for allm € M; 
thus, M C TJ and, in particular, 0 € 7. Ifm; ¢ M,r; € R, then 


(m, + ra) — (m2 + 12a) = (mM, — m2) + (71 —12)a € I. 


Also, for any s € R, s(m, + r1a) = sm, + srja. As sm, € M and sr; € R, we 
see that J has the absorption property and is, therefore, an ideal. But we noted above 
that M C J. Furthermore, a = 0+ la € I\M. By the maximality of M, we have 
I = R. In particular, 1 € 7, so there exist m € M andr € R such thatm+ra= 1. 
But then (r + M)(a+ M) =1—m+M=1+M,sincem € M. That is,r + M is 
the inverse of a + M, and R/M is a field. 

Conversely, suppose that R/M is a field. We must show that M is maximal. If 
M = R, then R/M consists only of a single additive left coset, contradicting the fact 
that a field must have a distinct 0 and 1. Thus, M ¥ R. Suppose that J is an ideal of 
RwithM CIC R.Takea € I\M.Now,a+M #£0+M,soa+M has an inverse, 
say b+ M. Then (a+M)(b+M) = 1+M;inother words, 1—ab € M C I. Butalso 
a € I, which means that ab € I by absorption, and therefore 1 = (1—ab)+ abe TI. 
By Theorem 9.2, J = R, giving us a contradiction and completing the proof. 


Example 9.29. This gives us another way to deal with Example 9.27. If n = 0, then 
we note that Z/{0} is simply Z, which is not a field. Thus, (0) is not maximal. If 
n = 1, then observe that Z/Z is the ring with one element, which is not a field; 
hence, (1) is not maximal. For any n > 2, we see from Example 9.24 that Z/nZ is 
isomorphic to Z,. But by Theorem 8.11, Z,, is a field if and only if n is prime. Thus, 
(n) is maximal if and only if n is prime. 


Example 9.30. By Example 9.25, R[x]/(x) is isomorphic to R, which is a field. 
Thus, (x) is a maximal ideal of R[x]. 


Example 9.31. In the same manner as Example 9.30, we see that Z[x]/(x) is iso- 
morphic to Z. But Z is not a field, and hence (x) is not maximal. In fact, we can 
see this by noting that (x) is properly contained in the ideal M consisting of those 
polynomials whose constant terms are multiples of 5. We can use Theorem 9.20 to 
show that M is maximal. See Exercise 9.43. 
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It is worth mentioning that Theorem 9.20 only applies when R is a commutative 
ring with identity. For instance, the ideal containing only the zero matrix is maximal 
in M)(R)! See Exercise 9.54. 


Definition 9.10. Let R be a commutative ring and P an ideal of R. Then we say 
that P is a prime ideal’ if 


1. P A R; and 
2. ifa,b € Randab ¢€ P,theneithera €¢ Porbe P. 


Example 9.32. In any integral domain, {0} is a prime ideal. If ab = 0, then a = 0 
orb=0. 


Example 9.33. Let us consider R = Z. By the preceding example, {0} is prime, so 
we know immediately that maximal and prime are not the same thing. Of course, 
(1) = R, so (1) is not prime. Suppose that n > 2. Ifn is composite, say n = k/ with 
1 < k,l <n, we see that kl € (n) but neither k nor / lies in (n). Thus, (7) is not 
prime. But if n is prime, then (7) is a prime ideal. Indeed, if ab € (n), then n|ab. 
Thus, by Theorem 2.7, n|a or n|b, and hence a or b is in (7). 


Once again, there is another way to handle this last example. 


Theorem 9.21. Let R be a commutative ring with identity and P an ideal. Then P 
is prime if and only if R/P. is an integral domain. 


Proof. Suppose that P is prime. Since R is a commutative ring with identity, so 
is R/P, by Theorem 9.7. Also, as P # R, R/P has more than one element, and 
therefore 0 + P #£ 1+ P. Thus, it remains to show that R/P has no zero divisors. 
Suppose that (a + P)(b+ P) =0+ P.Thenab € P,andhencea € Porbe P. 
That is,a+ P=0+ Porb+ P=0+4 P, and R/P is an integral domain. 
Conversely, let R/P be an integral domain. As R/P cannot be the ring with one 
element, P # R. Suppose that ab € P. Then (a+ P)(b+ P) = 0+ P. Since R/P 
has no zero divisors,a + P =0+ Porb+ P=04 P. Thatis,a ¢ Porbe P, 
and P is prime. 


Example 9.34. Let us look at Example 9.33 again. We know that Z/(0) is just Z, 
which is an integral domain, and hence (0) is a prime ideal. If n > 2, then Z/(n) is 
isomorphic to Z,, by Example 9.24, and Theorem 8.11 tells us that this is an integral 
domain if and only if n is prime. Thus, for a nonnegative integer n, (1) is a prime 
ideal of Z if and only if n is 0 or prime. 


Example 9.35. Refer to Example 9.31. We see that (x) is a prime ideal in Z[x], since 
Z{x]/(x) is isomorphic to Z. 


Example 9.36. Naturally, (x) is prime in R[x], because we saw in Example 9.30 
that R[x]/(x) is isomorphic to R, which is a field, hence an integral domain. 


?Please note that for noncommutative rings, the definition of a prime ideal is different. We will only 
concern ourselves with prime ideals in commutative rings in this book. 
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Of course, this last example can be generalized. 


Theorem 9.22. Let R be a commutative ring with identity. Then every maximal ideal 
of R is also a prime ideal. 


Proof. Use the last two theorems and the fact that every field is an integral domain. 


As we have already seen, not every prime ideal is maximal. Also, this last theorem 
only applies to commutative rings with identity. In some commutative rings without 
an identity, it is possible to find maximal ideals that are not prime, and Exercise 9.51 
asks for an example of this phenomenon. 


Exercises 


9.47. Let R be the ring from Exercise 8.17. Is the ideal (2) prime? Is it maximal? 


9.48. Find all prime ideals in each of the following rings. 


1. Zio 
2. Zso 


9.49. Let R be a finite commutative ring with identity. Show that every prime ideal 
of R is maximal. 


9.50. Find every maximal ideal of Z7 @ Z7. 


9.51. Find an example of a commutative ring having an ideal that is maximal but 
not prime. 


9.52. Suppose that R is a commutative ring with identity in which the elements of 
R that are not units form an ideal. Show that this ideal is the unique maximal ideal 
of R. 


9.53. Show that every field has the property described in the preceding exercise. 
Also show that Z,» has this property, for every prime p and positive integer n. 


9.54. Show that the ideal containing only the zero matrix is maximal in M2(R). 


9.55. Let R be acommutative ring with identity having a prime ideal J. Find a prime 
idealin R @ R. 


9.56. Let R A {0} be a commutative ring with identity. Suppose that every proper 
ideal of R is prime. Show that R is an integral domain, and then use this information 
to show that R is, in fact, a field. 


Chapter 10 ®) 
Special Types of Domains ra 


In this chapter, we begin with a specific and rather familiar sort of integral domain, 
and then generalize slightly in each section. First, we define a polynomial ring over 
a field, and show that we have a division algorithm in such a ring. As a result, this 
polynomial ring is a special type of ring called a Euclidean domain. 

Subsequently, we demonstrate that Euclidean domains are principal ideal domains; 
that is, every ideal is principal. Finally, we prove that principal ideal domains are 
examples of unique factorization domains, in which we have something similar to 
the Fundamental Theorem of Arithmetic. 


10.1 Polynomial Rings 


We are certainly familiar with polynomials having real coefficients. There is no 
reason why we cannot consider coefficients in other rings. 


Definition 10.1. Let R be a ring. Then a polynomial with coefficients in R is a 
formal expression 
Ay + a,x + gx” +--+ + ayx”, 


where a; € R and n is a nonnegative integer. Suppose that by + byx +--+ + Dyx™ 
is also a polynomial with coefficients in R. Without loss of generality, let us say that 
n < m. Then these polynomials are equal if and only if a; = b; for all i < n and 
b; = 0 for all i > n. The set of all polynomials with coefficients in R is denoted 
R[x]. 


Example 10.1. Let R = Zs. Then (inserting congruence class brackets for clarity), 
an example of a polynomial in R[x] would be f(x) = [3] + [2]x + [4]x?. As part 
of the above definition, we observe that f(x) = g(x), where g(x) = [3] + [2]x + 
[4]x? + [0]x?. 
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Note that the x in a polynomial is not an element of R. It is simply a placeholder 
in the expression of the polynomial. We could, equally well, define the polynomials 
in terms of sequences of elements of R (with only finitely many terms different from 
zero). But nobody thinks of polynomials in that way. 


Definition 10.2. Let R be a ring and let f(x) = agp + ayx +--+ + a,x" € R[x]. 
Further suppose that a,, 4 0 but a, = 0 for all k > m. Then the degree of f(x) is m, 
and we write deg(f (x)) = m. The leading term of f(x) is a,x’, and the leading 
coefficient is a,,. Note that the zero polynomial, 0, has no degree, leading term or 
leading coefficient. A constant polynomial has degree 0 (or is the zero polynomial). 
If R has an identity, then f(x) is monic if its leading coefficient is 1. 


Example 10.2. In Q{x], let f(x) = 3 + 7x — 15x? + Ox? + 2x* + Ox°. Then 
deg(f(x)) = 4, the leading term is 2x* and the leading coefficient is 2. This poly- 
nomial is not monic. 


We wish to make R[x] into a ring, and so we need addition and multiplication 
operations. These will be exactly the same as for real polynomials. Let f(x) = 
dao tayx +--+ + a,x" and g(x) = bo +bix +--- + byx”. Adding in terms with 
zero coefficients if necessary, we may assume that m = n. Then 


f(x) + g(x) = (ao + bo) + (ay + Bi) + +++ + (Gn + Bn) x". 


Similarly, 
Fx)g (x) = co + ere + 2K? Hee Hemp nx, 


where 
C= dgb; + ayb;_; + anb;_> feet t ajbo. 


Here, we take a; = Oif j > nandb; =Oif j >m. 


Example 10.3. In Z7[x], let f(x) = 5+2x+6x? and g(x) = 3+x+4x?+5x3. Then 
f@+g(x) = 1+3x+3x?+5x3 and fx)g(x) = 1+4x +5x7+4x3+6x4+42x°. 


Theorem 10.1. Jf R is a ring, then so is R[x]. 


Proof. Let us show that R[x] is an abelian group under addition. Clearly the sum of 
two polynomials is a polynomial. Let f(x) = dgt+---+anx”, g(x) = bo+---+byx™ 
and h(x) = co + --- + cyx*. Then the coefficient of x! in f(x) + g(x) isa + bi, 
and similarly for g(x) + f(x) (adding in terms with zero coefficients if necessary). 
Thus, addition is commutative. In the same way, because the addition of coefficients 
is associative, addition in R[x] is associative. The zero polynomial is the additive 
identity, and — f(x) = —ao —--- — a,x". Therefore, R[x] is an abelian group under 
addition. 

Evidently, the product of two polynomials is a polynomial. Let us check a dis- 
tributive law. The coefficient of x! in f (x)(g(x) + h(x)) is 
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ag(Bi + ci) + a1 Bi-1 + ci-1) ++ ++ +4; (bo + Co). 


But this is (agb; +----++a;bo) + (aoc; + --- +. a;c9), which is the coefficient of x! in 
Ff (x)g(x)+ f (x)h(x). The other distributive law is proved similarly. Finally, we must 
check that multiplication is associative. But by repeated application of the distributive 
laws, we see that we may reduce to proving that (a,x"byx")cyx” = ayx" (byx’cywx"), 
for all a,, by, cy € R and all u,v, w > 0. However, both sides of this equation are 
equal to a,byc,,x"*"t", and the proof is complete. 


Corollary 10.1. Let R be a ring. Then 


1. if R has an identity, then so does R[x]; and 
2. if R is commutative, then so is R[x]. 


Proof. (1) The constant polynomial | is the identity. 

(2) Repeatedly applying the distributive laws, we see that we need only check that 
a;x' and bj x commute, where a;,b; € R andi, j > 0. But a;x' bj xi= ajb; xt 
and bjx/aj;x' = bja; ;x'*J, Since R is commutative, these are equal. 


When our ring is an integral domain, degrees of polynomials behave in a way we 
would expect. 


Theorem 10.2. a R be an integral domain, and let f(x) and g(x) be nonzero 
polynomials in R[x], of degree m and n respectively. Then 


I. deg(f (x) + g(x)) is at most the larger of m and n (or f (x) + g(x) = 0); and 
2. deg(f(x)g(x)) =m +n. 


Proof. (1) This is clear from the definition of polynomial addition. 

(2) Let f(x) = ag +--+ + ayx™ and g(x) = bo +---+ b,x". Then we see 
from the definition of polynomial multiplication that the only term of highest degree 
in f(x)g(x) is Gnbyx™*". Furthermore, a, 4 0 4 b, and, since R is an integral 
domain, a,b, 4 0. Thus, deg(f (x)g(x)) =m+n. 


Note that the second part of the theorem fails if R is not an integral domain. 
For instance, in Ze[x], we have (2 + 3x)(1 + 2x) = 2+ x, which does not have 
degree 2. 


Corollary 10.2. Jf R is an integral domain, then so is R[x]. 


Proof. By Corollary 10.1, R[x] is a commutative ring with identity. Furthermore, 
1 £0. By the preceding theorem, the product of nonzero polynomials cannot be the 
zero polynomial. 


Why are we so interested in polynomial rings? We now know that if F is a field, 
then F'[x] is an integral domain. But it has another attractive property. Indeed, we 
have an analogue of the division algorithm with which we are familiar for the integers. 
Readers who have seen polynomial long division for real polynomials will find the 
procedure very similar. 
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Theorem 10.3. (Division Algorithm for Polynomials). Let F be a field, and let 
f(), g(x) € F[x], with g(x) 4 0. Then there exist unique q(x), r(x) € F[x] such 
that 


f(x) = g@)q(x) + r(x), 
with either r(x) = 0 or deg(r(x)) < deg(g(x)). 


Proof. Let us verify the existence of g(x) and r(x). If f(x) = 0, there is noth- 
ing to do; indeed, we let g(x) = r(x) = O. Therefore, assume that f(x) is not 
the zero polynomial. We proceed by strong induction on deg(f(x)). Suppose that 
deg(f(x)) = 0. If deg(g(x)) > 0, then use g(x) = O andr(x) = f(x). On the other 
hand, if deg(g(x)) = 0, then g(x) = Dis a nonzero constant in F. As F is a field, 
we have b~! € F, and we can use g(x) = b7! f(x) and r(x) = 0. 

Thus, suppose that deg( f(x)) =n > 0 and that our result holds for polynomials 
of smaller degree. Let us write f(x) = do + a;x +---+ a,x". Also suppose that 
deg(g(x)) = m, and write g(x) = bo +Dix+---+b),x”.Ifn < m, then we can use 
q(x) = O and r(x) = f(x). Otherwise, notice that in f(x) — ety, aa"; no 
term of degree greater than n appears, and the coefficient of x” is dy — bmb,, la, =0; 
thus, either f(x) — g(x)b;,!a,x"~™ is the zero polynomial, or it has degree strictly 
smaller than f(x). By our inductive hypothesis, there exist g(x), r(x) € F[x] such 
that f(x) — eb, ax = g(x)q(x) + r(x), with r(x) = 0 or deg(r(x)) < 
deg(g(x)). But then f(x) = g(x)(q(x) + al am + r(x), as required. 

Now for uniqueness. Suppose that f(x) = g(x)q(x) +r(x) = g(x)qi(x) +r), 
with g(x), gi(x), r(x), (x) € F[x] and each of r(x) and r; (x) either is 0 or has 
degree smaller than that of g(x). Then g(x)(¢(x) — qi(x)) = 71(x) — r(x). Suppose 
that g(x) 4 qi(x). By Theorem 10.2, deg(g(x) (q(x) — qi(x))) = deg(g(x)), but 
r(x) — r(x) cannot possibly have a degree that large. Thus, g(x) = q,(x) and hence 


r(x) =ri(). 


The proof also shows us how to construct g(x) and r(x). We look only at the 
leading terms of f(x) and g(x) (say, respectively, a,x” and b,x”). Assuming that 
n > m, we subtract b>'a,x"~" g(x) from f (x) and obtain either the zero polynomial 
or a polynomial of degree smaller than deg( f (x)). Then repeat. 


Example 10.4. Letus apply the division algorithm in Q[x] with f(x) = 8x+—4x3+ 
2x? +x+1 and g(x) = 2x? +3x +7. We take 27! -8x4-? = 4x”, multiply by g(x) 
and subtract from f(x). 


4x2 
2x* + 3x +7) 8x4 —4x3 4227 +2x4+1 
— 8x4 — 12x3 — 28x? 
— 16x? — 26x? +x 


Next, take 2~'(—16)x?-? = —8x, multiply by g(x) and subtract. 
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4x? — 8x 
2x* + 3x +7) 8x4 —4x3 42x27 +2x4+1 
— 8x4 — 12x3 — 28x? 
— 16x? —26x2 +x 
16x? + 24x? + 56x 


oe" 4 5741 


Finally, take 2~!(—2)x?-? = —1, multiply by g(x) and subtract. 


4x? —8x-—1 
2x* + 3x +7) 8x* —4x3 4+2x2 4x41 
— 8x4 — 12x3 — 28x? 
— 16x23 — 26x? +x 
16x? + 24x? + 56x 


— 2x2 +57x +1 
2x? +3x+7 


60x + 8 


We now have a remainder with degree smaller than deg(g(x)), so we are done. 
Indeed, f(x) = g(x)(4x? — 8x — 1) + (60x + 8). 


Note that it is not sufficient to work in R[x], where R is an integral domain. 
By Corollary 10.2, R[x] is also an integral domain, but we cannot implement the 
division algorithm if we are unable to take the inverse of the leading coefficient of 
g(x). Indeed, if we worked in Z[x], we would be immediately stymied if we tried to 
perform the division algorithm using f(x) = 2x* + 3x + 5 and g(x) = 3x+7. 

In fact, a polynomial ring over a field is a nice example of a special type of integral 
domain that we can now discuss. 


Exercises 
10.1. In Zy,[x], let f(x) = 2x3 + 4x? + 2x +5 and g(x) = 2x4 + 5x9 + 7x +1. 
Find f(x) — g(x) and f(x)g(x). 


10.2, Let f(x) = 3x? +x44+.x° + 3x7 + 2x +4 and g(x) = 2x7 + 3x7 +2441 
be polynomials in Zs[x]. Find g(x), r(x) € Zs[x], with deg(r(x)) < 3, such that 
f(x) = g(x)q(x) +r). 


10.3. Let f(x) = 3x9 + 6x4 + x3 + 3x? + 2x +4 and g(x) = 2x7 +3x7 +x41 
be polynomials in Z7[x]. Find g(x), r(x) € Z7[x], with deg(r(x)) < 3, such that 
f(x) = g(x)q(x) +r). 


10.4. Let R be an integral domain. Show that the units of R[x] are precisely the 
constant polynomials a, where a € U(R). 


10.5. If F isa field, is F[x] a field? 
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10.6. Show that 2x + 1 is a unit in Z4[x]. Then, for any prime p, find a unit in 
Z p2[x] that is not a constant polynomial. 


10.7. For any ring R, show that R and R[x] have the same characteristic. 
10.8. If R and S are isomorphic rings, show that R[x] and S[x] are also isomorphic. 


10.9. Let S be a subring of R. Show that S[x] is a subring of R[x]. In particular, if 
S is an ideal of R, show that S[x] is an ideal of R[x]. 


10.10. Let R be a commutative ring with identity and P a prime ideal of R. Show 
that P[x] is a prime ideal of R[x]. 


10.2 Euclidean Domains 


A Euclidean domain is an integral domain having an additional property. 


Definition 10.3. Let R be an integral domain. Then a Euclidean function is a func- 
tion ¢ from the set of nonzero elements of R to the nonnegative integers such that, 
for all nonzero a, b € R, we have 


1. e(a) < e(ab); and 
2. there exist g,r € R such that a = bg +r, and either r = 0 or e(r) < e(b). 


Definition 10.4. A Euclidean domain is an integral domain having a Euclidean 
function. 


We have already seen several examples of Euclidean domains. 


Example 10.5. The integers form a Euclidean domain. We already know that Z 
is an integral domain. Define e(a) = |a|. If a and b are nonzero integers, then 
|ab| = |a||b| = |a|. Furthermore, by the division algorithm, there exist g,r € Z 
such that a = |b|g +r, withO <r < |b|. If b > 0, we are done. Otherwise, simply 
note that a = b(—q) +r. 


Example 10.6. Any field is a Euclidean domain. See Exercise 10.12. 


Example 10.7. If F is a field, then F'[x] is a Euclidean domain. Indeed, Corol- 
lary 10.2 tells us that it is an integral domain. For any 0 # f(x) € F[x], let 
e(f (x)) = deg(f(x)). If0 4 g(x) € F[x], then by Theorem 10.2, deg( f(x) g(x)) = 
deg(f (x)) + deg(g(x)) => deg(f (x)). The division algorithm for polynomials com- 
pletes the proof. 


Let us construct a new Euclidean domain. 
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Example 10.8. Let R = {a+ bi : a,b € Z}. We call this the ring of Gaussian 
integers. By Exercises 8.17 and 8.27, R is a subring of F = {a+ bi:a,beQ} 
which, in turn, is a subfield of C. We claim that R is a Euclidean domain. It is surely 
an integral domain, since it is a unital subring of a field and therefore has no zero 
divisors. It remains to construct a Euclidean function. 

Define e¢ : F > Qviae(a+ bi) = a* +b. In particular, if0O Aa+bi € R, 
then e(a+ bi) € N. Ifa, b,c,d € Q, then 


é((a + bi)(c + di)) = e((ac — bd) + (ad + bc)i) 
= (ac — bd)” + (ad + bc)’ 
=~ Pe 4 Pe 4 e+ Pe 
= (a? +b’)(c* +d’) 
= (a+ bi)e(c+ di). 


In particular, if a + bi and c + di are nonzero elements of R, then 


é(a+ bi) < e((a+ bi)(c+di)). 


Take any nonzero u,v € R. Then as F is a field, uv—! € F. Let us write uv! 


s+ti, with s, t € Q. Choose integers m and n such that |s —m| < 5 and lt—n| < 5. 
Then 
u—v(m+ni) =u—v((s + ti) + (m—s)+ (n—t)i)) 


=u —v(uv')+v((s —m) + (t —n)i) 
= v((s —m)+ (t —n)i). 


Now, 
e((s—m) + (=m) = (6m) + tn) s 5. 
Therefore, 
é(u — vim +n)i) = e(v)e((s — m) + (t —n)i) < ev). 
Letting g = m+ni andr = u — v(m + ni), we have u = vq +r and we are done. 
What is so special about Euclidean domains? Let us begin with some definitions. 


Definition 10.5. Let R be a commutative ring with identity. If a,b € R, then we 
say that a divides b, and write a|b, if there exists ac € R such that b = ac. 


Of course, this agrees with our definition of divisibility in Z. We are very much 
interested in extending the notion of a greatest common divisor as well. For an 
arbitrary ring, this is problematic, as there is no particular notion of ordering. But 
for a Euclidean domain, we have ¢! 
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Definition 10.6. Let R be a Euclidean domain, and let a, b € R, not both zero. Then 
a nonzero element d of R is said to be a greatest common divisor (or gcd) if 


1. dla and d|b; and 
2. whenever c is an element of R satisfying cla and c|b, we have e(c) < e(d). 


Certainly a gcd must exist. Indeed, | is a common divisor of any two elements, so 
the set of common divisors is not empty. Furthermore, by definition of a Euclidean 
function, if cla and a # 0, then e(c) < e(a). Thus, there is an upper bound on the 
€ values of the common divisors, so we can select one having the largest possible 
value. 

Notice that we called d “a gcd’, not “the gcd”. Indeed, this definition does not 
produce a unique gcd. In particular, in Z, we see that both 5 and —5 would meet the 
description of “a gcd” of 10 and 35. However, when we say “the gcd”, we will still 
mean the positive one; that is, (10, 35) = 5, not —5. 

Similarly, if F is a field, suppose that d(x) is a gcd of f(x) and g(x). Ifuisa 
nonzero element of F, we see immediately that ud(x) also divides both f(x) and 
g(x), and that deg(ud(x)) = deg(d(x)). Thus, ud(x) is also a gcd. But again, we 
can choose a specific gcd here. 


Definition 10.7. Let F be a field and let f(x) and g(x) be polynomials in F'[x], not 
both the zero polynomial. By the ged of f(x) and g(x) we mean a monic gcd. When 
we write (f(x), g(x)), we mean specifically this monic gcd. 


For more general Euclidean domains, we cannot easily single out a particular gcd 
in this manner. But we will see that, in fact, the gcds are all related to each other in 
a nice way. While proving this, we can produce some other interesting results. For 
instance, the Euclidean domain is so named because there is a Euclidean algorithm 
just like in Z. 


Theorem 10.4 (Euclidean Algorithm for Euclidean Domains). Let R be a 
Euclidean domain. Take a,b € R with b & 0. If bla, then b is a gcd of a and 
b. Otherwise, apply the division algorithm repeatedly. To wit, write 


a=bq+nr 
b=ngtr 


ry =193 +13 


Te-2 =Vk-19k + 1k 
rk-1 = regke+1 +9, 


where all q;,r; € Randr; € 0, with e(r|) < e(b) and e(r;) < e(rj_-1) forall j = 2. 
Then r, is a gcd of a and b. 
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Proof. If b\a, then b is a common divisor of a and b. Also, if c|b, then e(c) < e(b), 
and so b is a gcd. Assume that b does not divide a, and we perform the division 
algorithm repeatedly, as indicated. 

Note, first of all, that this process must end, as the e(r;) are strictly decreasing 
integers and cannot be negative. Suppose that cla and c|b, say a = ca, and b = cay, 
with a; € R.Thenr; = c(a; — aq), and hence c|r;. Similarly, any common divisor 
of b andr; must also divide a. Thus, the set of common divisors of a and b is precisely 
the same as the set of common divisors of b and r,. In particular, they have the same 
set of gcds. 

By the same argument, the gcds of b and r; are the same as those of r; and r2. We 
then repeat this and find that the gcds of a and b are the same as the gcds of r;, and 
0. But as everything divides 0, we are looking only for the largest value of ¢ among 
the divisors of r;. However, as if u|v and v 4 0, then e(u) < e(v), we see that r; is 
a gcd of rz and 0, as required. 


Corollary 10.3. Let R be a Euclidean domain. Take a, b € R with b 4 0. Let d be 
the gcd of a and b found in the preceding theorem. Then there exist u,v € R such 
that d = au + bv. 


Proof. If bla, then d = b = a(O) + b(1). Assume otherwise. We have d = ry, = 
rp-2 + e-1(—qx), a multiple of rz. plus a multiple of r,_;. But the preceding 
equation is r,_3 = Ig_29¢—-1 +11. Thus, 


d =rg_2 + (re-3 + re-2(—Gk-1)) (— 4x). 


We have written d as a multiple of 7,_3 plus a multiple of r,_2. Now move backwards 
through the equations, and we will eventually write d as a multiple of a plus a multiple 
of b. 


Example 10.9. Let us apply the Euclidean algorithm and its corollary in Z7[x], 
starting with f(x) = 2x3 + 4x? +x +1 and g(x) = 6x? + 4x? + 4x +5. We write 


2x? + 4x7 4x +41 = (6x? + 4x? + 4x +:5)(5) + (5x7 + 2x +4) 
6x? + 4x? + 4x +5 = (5x7 4+ 2x +4)(4x +2) 4 (5x +4) 
5x7 +2x+4= (5x +4)(x4+1) +0. 


Thus, 5x + 4 is a ged of f(x) and g(x). Now let us apply the method discussed in 
the proof of the preceding corollary. We have 


5x +4 = g(x) — (5x? + 2x + 4)(4x + 2) 
= g(x) — (f(x) — g(x) (5)) (4x + 2) 
= f(x)(3x +5) + g(x) (6x + 4). 


If we want to use (f (x), g(x)), we must make it monic. Now, 5~! = 3, and therefore 
(f(x), g(x)) = 3(5x + 4) = x +5. Then we get 


180 10 Special Types of Domains 
x+5 = 3(5x +4) = f(x)(2Qx + 1) 4+ g(x) (4x +5). 
Corollary 10.4. Let R be a Euclidean domain, and let a,b € R with b £ 0. Let d 


be the gcd of a and b found in Theorem 10.4. Then if c € R is a divisor of both a 
and b, then c\d. 


Proof. We have d = au + by, for some u, v € R. If cla and c|b, then cld. 


Let us now discuss how the gcds of two elements of a Euclidean domain relate to 
each other. 


Definition 10.8. Let R be a commutative ring with identity. If a,b € R, then we 
say that a and b are associates if there exists a unit u of R such that b = au. 


Note that if b = au, where u is a unit, then a = bu-!. Thus, if a is an associate 
of b, then b is an associate of a. 


Example 10.10. In Z, the only units are 1 and —1, so the only associates of a are a 
and —a. 


Example 10.11. Let F be a field. The units in F[x] are the nonzero constants. (See 
Exercise 10.4.) Thus, the associates of f (x) are of the formaf (x), whereO 4a eé F. 


Lemma 10.1. Let R be an integral domain. Then a and b are associates in R if and 
only if a\b and bla. 


Proof. If a and b are associates, the fact that a|b and b|a follows from the definition. 
Suppose that a|b and bla, say b = ar anda = bs, withr, s € R. Thena = ars. If 
a = 0, then b = 0, soa = b- 1. Otherwise, by cancellation, rs = 1, and hence r is 
a unit. 


Theorem 10.5. Let R be a Euclidean domain. Take a,b € R, not both 0. Let d 
be any gcd of a and b. Then c € R is a gcd of a and b if and only if c and d are 
associates. 


Proof. Suppose that c is a gcd of a and b. Let g be the gcd of a and b found in 
Theorem 10.4. By Corollary 10.4, c and d divide g. Applying the division algorithm, 
we have c = gq +r, where g,r € R and either r = 0 or e(r) < e(g). Suppose the 
latter. Now, c|g, and therefore c|r = c — gq. But then e(c) < e(r). However, if c 
and g are both gcds, we must have ¢(c) = €(g), giving us a contradiction. Therefore, 
r = O and g|c. By the preceding lemma, g and c are associates, say c = gu, with u 
aunit in R. By the same argument, d = gv, where v is a unit in R. Thenc = duv"|, 
where uv~! is a unit in R, and hence c and d are associates. 

Conversely, let c and d be associates. Then since d|a and d|b, we have cla and c|b 
as well. Furthermore, since c|d and d|c we can only have e(c) = e(d). Therefore, 
is a gcd of a and b. 


9 
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We can now feel better about Definition 10.7, where we referred to “the” monic 
gcd of f(x) and g(x) in F[x]. As any two gcds are associates, and the only units are 
nonzero elements of F, there can only be one gcd that is monic. 

Time to tidy up! We can strengthen Corollary 10.3. It actually applies to any gcd, 
not just the one found in Theorem 10.4. 


Theorem 10.6. Let R be a Euclidean domain. Take a,b € R, not both 0. Let d be 
a gcd of a and b. Then there exist u,v € R such that d = au + by. 


Proof. Without loss of generality, assume that b ¢ 0, and calculate the gcd g of a 
and b from Theorem 10.4. Then by Corollary 10.3, g = au+ bv, for some u,v € R. 
But by Theorem 10.5, d = gw, for some unit w of R. Thus, d = auw + bvw. 


We conclude by strengthening Corollary 10.4. 


Theorem 10.7. Let R be a Euclidean domain. Take a,b € R, not both 0. Then the 
following are equivalent for an element d of R: 


I. disagcd ofaandb; and 
2. dla, d\b, and if cla and c\|b, then c\d. 


Proof. Suppose (1) holds. Without loss of generality, assume that b 4 0. By def- 
inition, d|a and d|b. Suppose that cla and c|b. If g is the gcd of a and b found in 
Theorem 10.4, then by Corollary 10.4, c|g. But Theorem 10.5 tells us that g|d. Thus, 
cld. 

Conversely, suppose that (2) holds. Then d is a common divisor of a and b. 
Suppose that c is another common divisor of a and b. Then by assumption, c|d. But 
this means that e(c) < e(d); hence, d is a gcd. 


A nice feature of Theorem 10.7 is that it shows that gcds in a Euclidean domain 
do not depend upon the particular Euclidean function that is used. 


Exercises 


10.11. In an integral domain, if a and ab are associates, show that a = 0 or bisa 
unit. 


10.12. Show that every field is a Euclidean domain. 


10.13. Let R be a Euclidean domain. Let n be the smallest value of e(s), for all 
0 #:s € R. Show that for each 0 4 a € R we have e(a) = n if and only ifa isa 
unit. 


10.14. Find all units in the ring of Gaussian integers. 


10.15. In Q[x], let f(x) = 3x4 + 7x3 + 13x? + 7x + 6 and g(x) = 2x4 + 7x3 + 
13x? + 11x +3. Find (f(x), g(x)). 


10.16. In Zs[x], let f(x) = 3x4 + 3x? +x 4+ 1 and g(x) = 2x3 +. 4x? 4x41. 
Find (f (x), g(x)). 
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10.17. Taking f(x) and g(x) as in Exercise 10.15, find u(x), v(x) € Q[x] such that 
(f(x), 8(x)) = f(x)u(x) + gx)v(a). 


10.18. Taking f(x) and g(x) as in Exercise 10.16, find u(x), v(x) € Zs[x] such 
that (f(x), g(x) = fu) + g(x)v@). 


10.19. Find a ged for 5 + 7i and 1 + 3i in the ring of Gaussian integers. 


10.20. Let R be a Euclidean domain having the following additional property: for 
every a,b € R such that a, b and a + b are all nonzero, e(a + b) is no bigger 
than the larger of e(a) and e(b). (For example, if F is a field, the degree function 
on F[x]\{0} has this property.) Show that in the second part of the definition of a 
Euclidean function, the elements g and r are uniquely determined. 


10.3. Principal Ideal Domains 


Let us discuss another sort of integral domain with a nice property. 


Definition 10.9. A principal ideal domain (or PID) is an integral domain in which 
every ideal is principal. 


A field F is an obvious example of a PID; indeed, its only ideals are (0) and 
F = (1). But we can obtain others through the following theorem. 


Theorem 10.8. Every Euclidean domain is a PID. 


Proof. Let R be a Euclidean domain with Euclidean function e, and J an ideal of R. 
If J = {0}, then J = (0), and there is nothing to do. Assume that J 4 {0}. Among the 
nonzero elements of J, choose b so that ¢(b) is as small as possible. (Since ¢ takes on 
values that are nonnegative integers, there must be a smallest such value.) We claim 
that J = (b). Take a € J. As € is a Euclidean function, we have a = bq + r, where 
q,r € Randeitherr = Oore(r) < e(b). Ifr = 0, then bla, as required. Otherwise, 
we note that a, b € I, and since J is an ideal, r = a — bq € I. But by the minimality 
of e(b), this is impossible. 


Example 10.12. Since Z@ is a Euclidean domain, it is a PID. 
Example 10.13. Let F be a field. Since F'[x] is a Euclidean domain, it is a PID. 


Proving that an integral domain is not a Euclidean domain can be a bit tricky; 
it is often simpler to show that is not a PID, from which it follows that it is not a 
Euclidean domain. 


Example 10.14. We claim that Z[x] is not a PID, and hence not a Euclidean domain. 
To prove this, consider the set J of all f(x) € Z[x] whose constant terms are divisible 
by 5. We saw in Exercise 9.2 that J is an ideal. But it is not principal. Indeed, suppose 
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that J = (f(x)). Then as the constant polynomial 5 is in 7, we see that f(x)|5. In 
view of Theorem 10.2, f(x) is a constant polynomial. As it divides 5, the constant 
must be in { 5}. However, (1) = (—1) = Z[x], whereas (5) = (—5) = 5Z[x], 
which does not include x. But x € J, and therefore 5Z[x] 4 I. 


— 


We might, at this point, ask if every PID is a Euclidean domain. The answer is 
no, but this is not obvious. Theodore S. Motzkin showed that there is a subring of 
the complex numbers that is a PID but not a Euclidean domain. We will not use this 
fact, but the interested reader can find an accessible proof in the paper of Wilson [1]. 

Let us explore a couple of other properties of PIDs. The following theorem shows 
that a PID has the ascending chain condition. 


Theorem 10.9. Let R be a PID. Suppose that R has ideals I, k € N, such that 
I, Clb CI, C.---. Then there exists a positive integer n such that I, = I, for all 
k>n. 


Proof. Let I = (Jp, Ix. We claim that J is an ideal. Certainly 0 € 7, C J. If 
a,b € T, then there exist positive integers k and/ such thata € i andb € Jj. Let 
m be the larger of k and/. Then a,b € I,, and hence a — b € I, C J. Similarly, if 
aeél,sayaeé k,andr € R, thenra € i C I. Thus, J is an ideal. As R is a PID, 
we must have J = (c) for some c € J. But thenc ¢€ [,, for some positive integer 
n. It now follows that J = (c) C J,. That is, 7 = J,, and hence J, = [, for all 
k>n. 


We are familiar with the notion of a prime positive integer. Let us extend the idea. 


Definition 10.10. Let R be an integral domain. Then an element p of R is prime if 
it is not zero, not a unit, and if p|ab, with a, b € R, then pla or p|b. 


We observe that the definition of a prime positive integer that we introduced in 
Chapter 2 is different. However, Theorem 2.7 assures us that the definitions are 
equivalent, for positive integers. Of course, the positive integers do not form a ring, 
so in Z, we see that the primes are +2, +3, +5, .... (Note that | and —1 are units, 
so we exclude them.) 

We have an easy lemma. 


Lemma 10.2. Let R be an integral domain, and take 0 4 p € R. Then p is prime 
if and only if (p) is a prime ideal. 


Proof. Let p be prime. If (p) = R, then there exists anr € R such that rp = 1; 
hence, p is a unit. But primes cannot be units, so this is impossible. If ab € (p), 
then p|ab, and hence p|a or p|b. Thus, a € (p) or b € (p), and (p) is a prime ideal. 
Conversely, suppose that (p) is a prime ideal and p|ab. Then ab ¢€ (p), and hence 
a € (p) orb € (p). That is, p|a or p|b. Furthermore, if p is a unit, then by Theorem 
9.2, (p) = R, which contradicts the assumption that (p) is a prime ideal. Thus, p is 
prime. 
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Definition 10.11. Let R be an integral domain, and take p € R. We say that p is 
irreducible if it is not zero, not a unit, and if p = ab, with a, b € R, then either a 
or b must be a unit. 


This is, essentially, the definition we used for a prime positive integer. As we noted 
above, in the integers, these concepts are equivalent. What is the general situation? 


Theorem 10.10. Let R be an integral domain. Then every prime in R is irreducible. 


Proof. Let p be a prime, and suppose that p = ab, with a,b € R. Then p|ab, so 
P\a or p\|b. Without loss of generality, say p|a. But a|p as well. By Lemma 10.1, p 
and a are associates. Thus, by Exercise 10.11, b is a unit, as required. 


Unfortunately, the converse is not true in general. 


Example 10.15. Let R = {a+ bV/5i : a,b € Z}. It is easy to check that R is a 
unital subring of C, and hence an integral domain. We can define a function called 
anorm on R via N(a + b/5i) =a’ +5b?. If u,v € R, then N(uv) = N(u)N(v). 
(This is the same calculation as in Example 10.8.) We claim that 3 is irreducible in 
R. If 3 = uv, then 9 = N(3) = N(u)N(). Noting that the norms of elements of 
R are nonnegative integers, we can only have N(u) = N(v) = 3 or, without loss 
of generality, N(u) = 1 and N(v) = 9. But the equation a* + 5b* = 3 has no 
solution in the integers, so N(u) = N(v) = 3 is impossible. Also, the only solutions 
toa? +5b* = larea € {1, —1} and b = 0. However, 1 and —1 are units in R. Also, 
3 is clearly not a unit, and the claim is proved. Nevertheless, 3 is not prime. To see 
this, we note that (2 + /5i)(2 — /5i) = 9. Of course, 3|9, but 3 does not divide 
244/51 ord 4/51, 


The good news, however, is that in a PID, primeness and irreducibility are equiv- 
alent. 


Theorem 10.11. Let R be a PID and p € R. Then p is prime if and only if p is 
irreducible. 


Proof. In view of Theorem 10.10, we only need to show the converse. Let p be 
irreducible, and let J = (p). We claim that J is a maximal ideal of R. If not, suppose 
that J is an ideal of R with J C J C R. Since R is a PID, we have J = (a), for 
some a € J. Now, p € I C J, so p = ab, for some b € R. As p is irreducible, 
either a or b is a unit. If a is a unit, then by Theorem 9.2, J = R, which is not 
permitted. Therefore, b is a unit. But then a = pb7! e€ I. Thus, J C J, which is 
also not allowed. On the other hand, if J = R, then p is a unit, which is impossible. 
Our claim is proved. 

By Theorem 9.22, a maximal ideal is necessarily prime. Lemma 10.2 completes 
the proof. 
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Exercises 


10.21. With R as in Example 10.15, show that 1 + 2,/5i is irreducible, but not 
prime. 


10.22. Let S be {a + bV/3i : a,b € Z}, a subring of C. Show that 1 + /3i is 
irreducible, but not prime. 


10.23. Show that R and S from the preceding two exercises are not PIDs. 


10.24. Let R be an integral domain. Show that an associate of an irreducible element 
is irreducible, and an associate of a prime element is prime. 


10.25. If R is a Euclidean domain, does it follow that R[x] is a Euclidean domain? 
Prove that it does, or give an explicit counterexample. 


10.26. Let R be a PID. Show that every proper ideal of R is a subset of a maximal 
ideal of R. 


10.27. Let R be an integral domain and pa prime in R. If p|a;a2--- ay, witha; € R, 
show that some aq; is divisible by p. 


10.28. Let R be a PID and 0 £a € R. Show that a is irreducible if and only if (a) 
is a maximal ideal. 


10.29. Let R be an integral domain, but not a field. Show that there exist infinitely 
many ideals [,, J2,... of R such that /,,; is a proper subset of /,, for all n. 


10.30. Let R be an integral domain. If R[x] is a PID, show that R is a field. 


10.4 Unique Factorization Domains 


We now reach our main conclusion, which is that every PID has an analogue of the 
Fundamental Theorem of Arithmetic. 


Definition 10.12. Let R be an integral domain. We say that R is a unique factor- 
ization domain (or UFD) if 


1. every nonzero, nonunit element of R can be written as a product of one or more 
irreducibles; and 

2. the product is unique up to order and associates; that is, if pjpo--- per = 
4192°°:q, for some irreducibles p; and q;, then k = / and, after rearranging, 
each p; is an associate of q;. 


Theorem 10.12. Every PID is a UFD. 
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Proof. Let R be a PID. We shall prove that R satisfies the first part of the definition 
of a UFD. Take any nonzero nonunit a; € R, and suppose that a, is not a product of 
irreducibles. If a; is irreducible then we have an immediate contradiction. Therefore, 
we may write a; = dbz, where az and bo are nonunits in R. If az and b2 are both 
products of irreducibles then again, we have a contradiction, as a; is then a product 
of irreducibles. Without loss of generality, let us say that aj is not a product of 
irreducibles. In particular, itis not irreducible, so write ag = a3b3, where a3 and b3 are 
nonunits, and so forth. Then we have a;+,|a; for all positive integers i. Furthermore, 
as bj41 iS not a unit, we see that a; and a;+; are not associates. By Lemma 10.1, a; 
does not divide a;+;. In particular, a; € (aj+1), but aj41 €¢ (aj), So (ai) | (ai41) 
for all positive integers i. But this contradicts Theorem 10.9, and we see that each 
nonzero nonunit is a product of irreducibles. 

Now let us verify the uniqueness. Suppose that p;--- py = 91 ---q, Where the 
pi and q; are irreducible, and k < 1. Then p;|q:---q. By Theorem 10.11, p; is 
prime. Thus, p; divides one of the terms in the product. After rearranging, we may 
assume that p;|qi. Let us write gj = piu, with u; € R. As qj is irreducible 
and p, is not a unit, we see that uw; is a unit, and hence p, and qj are associates. 
Thus, pi po--: Pk = “1 pig2-:- qi. Cancelling, we have p2--- py = uig2--- qi. 
Now, p2|“1q2--- gq), and since pp is prime, it divides a term in the product. Since a 
divisor of a unit is a unit, we cannot have p2|u;, and therefore p2|g;, for some i > 2. 
Rearranging, we have p2|q2. Just as before, we see that gz = p2u2, for some unit 
uz. Repeating, we find that p; and gq; are associates, 1 <i < k. Ifk = 1, we are 
done. Otherwise, we have 1 = uw; ---uggx41--- qi. But nonunits cannot divide 1, so 
we have a contradiction. 


Our examples of UFDs will largely be PIDs. 


Example 10.16. As we already knew from the Fundamental Theorem of Arithmetic, 
Z is a UFD. 


Example 10.17. For any field F, F[x] is a UFD. 


There are also UFDs that are not PIDs. In fact, Z[x] is such a ring. We opt to 
postpone the proof of this until Section 11.2. 

What sort of integral domains are not UFDs? Either of the two conditions could 
fail. Let us first consider one where nonzero nonunit elements are not necessarily 
products of irreducibles. 


Example 10.18. Let R be the subset of Q[x] consisting of all polynomials with an 
integer constant term. It is easy to see that R is a unital subring of Q[x]. As Q[x] 
is an integral domain, so is R. We claim that the only units of R are the constant 
polynomials | and —1. Indeed, a unit is necessarily a unit in Q[x] as well. By Exercise 
10.4, our unit is a nonzero constant a. But as the constant term of an element of R 
must be an integer, we see that ifaf (x) = 1, thena can only be £1, proving the claim. 
In particular, x is a nonzero nonunit. If we write x = p,(x)--- px(x), a product of 
irreducibles, then all but one of the p;(x) (say pi(x)) are integers and p(x) = qx, 
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for some 0 £ q € Q. But qx is not irreducible; indeed, gx = 2 (fx), and neither 2 
nor 4x is a unit. Thus, x is not a product of irreducibles, and R is not a UFD. 


Even if every nonzero nonunit is a product of irreducibles, this product may not 
be unique. 


Example 10.19. Consider the ring R = {a+b/5i : a,b € Z} from Example 10.15. 
We noted in that example that 3 is irreducible. Applying a similar argument, we can 
see that 2+ /5i and 2 — /5i are irreducible. (We do have to check that they are not 
units, but if uv = 1, then N(u)N(v) = N(1) = 1, and as we noted in Example 10.15, 
this must mean that w = v = £1.) Thus, we can write 9 = 3-3 = (2+-/5i)(2—V/5i), 
giving two different products of irreducibles. As the only units are £1, we see that 3 
is not an associate of 2+ /5i or 2— \/5i. Therefore, our factorization is not unique. 


We close with a few remarks concerning divisibility in a UFD. 


Theorem 10.13. Let R be a UFD, and let a and b be nonzero nonunit elements 
of R. Then there exist irreducibles p,,..., Py, none of which are associates, such 
that a = up|'--: py‘ and b = vp}'--- py‘, for some units u,v € R and some 
nonnegative integers m; and n;. Furthermore, a\b if and only if m; <n; for all i. 


Proof. Write each of a and b as a product of irreducibles. List all of the irreducibles 
that appear, and if some are associates, say qg), q2,..., then delete all but one. Let 
P1,---s Px be the irreducibles that remain. Then a can be written as a product of 
irreducibles, each of which is an associate of some p;, and so can be written as a 
product of a unit and p;. Gathering the units together, we obtain our expression for 
a, and similarly for b. 

If m; < n; for all i, then we see that b = avu7! pj!" --- pp’ *; hence, alb. 
Conversely, without loss of generality, suppose that m,; > n,. If a|b, then write 
b = aac. As R is an integral domain, we can use cancellation, and obtain 


m,—n, m2 me, .—-1 n2 Nk 
Pi Po -*** Py C =U VPo +: PE - 


Here, c is either a unit or a product of irreducibles. By unique factorization, p; must 
be an associate of one of u—'vpo, P3,--+, Px. But by our choice of the p;, this is 
impossible. 


A UFD does not necessarily have anything comparable to a Euclidean function, so 
we cannot order elements in any logical way. However, we can obtain the equivalent 
form of a gcd given in Theorem 10.7. 


Theorem 10.14. Let R be a UFD. Take any nonzero nonunits a, b € R, and write 
them in the form a = up}''--- py", b = vp{'+-- p;‘, as in Theorem 10.13. Let 
d= p .: . pi, where 1; is the smaller of m; and n;, for alli. Then d\a, d\b, and if 


cla and c\|b, then cld. 
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Proof. Theorem 10.13 tells us that d|a and d|b. Suppose that cla and c|b. If c isa 
unit, then surely c|d. Suppose it is not. Then write a = cr, withr € R. Now c can 
be written as a product of irreducibles, and r is a unit or a product of irreducibles. 
By unique factorization, all of these irreducibles must be associates of the p;. Using 
Theorem 10.13 again, we can write c = wp ee Ds where w is aunit and jx, < mx, 
for all k. By the same argument, as c|b, we have j, < nz, and hence jx < Jx, for all 
k. Therefore, cld. 


Exercises 
10.31. Show that 1 +i is prime in the ring R of Gaussian integers. 


10.32. In the ring of Gaussian integers, which of the numbers 3, 5 and 7 are irre- 
ducible? 


10.33. Must a unital subring of a UFD be a UFD? Prove that it must, or give an 
explicit counterexample. 


10.34. Let R be a UFD. Suppose that a and b are nonzero nonunit elements of R. If 
d, and dy are gcds of a and b (in the sense discussed in the second part of Theorem 
10.7), show that d, and d> are associates. 


10.35. Let R = {a+bV6i : a,b € Z}. Finda, b,c, d € Rsuchthat 10 = ab = cd, 
but a, b, c and d are all irreducible and neither of {a, b} is an associate of either of 
{c, d}. Conclude that R is not a UFD. 


10.36. Let R be a UFD, and let p be an irreducible element of R. If a and b are 
nonzero nonunits of R, and p|ab, writing both a and b as products of irreducibles, 
show that p is an associate of at least one of the irreducibles appearing in at least 
one of these products. 


10.37. Show that every irreducible in a UFD is prime. 


10.38. Let R be a UFD. Suppose that there exist a,, a2,... € R such that (a,) C 
(a2) C ---. Show that there exists an i such that (a;) = (aj+1). 
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Part IV 
Fields and Polynomials 


Chapter 11 ®) 
Irreducible Polynomials cro 


Let F'[x] be the polynomial ring over a field F. If f(x) € F[x], we can now discuss 
some conditions under which f (x) is irreducible. 


11.1 Irreducibility and Roots 


For any field F, we recall that the polynomial ring F'[x] is a UFD (see Example 
10.17). Also, by Exercise 10.4, the units in F'[x] are the nonzero elements of F’. Thus, 
every polynomial of degree greater than 0 is a product of one or more irreducibles. 
Here, a polynomial f(x) of degree greater than 0 is irreducible over F if, whenever 
f() = g(x)h(x) for some g(x), h(x) € F[x], either g(x) or h(x) is an element of 
F. Otherwise, f(x) is reducible. Note that irreducibility depends very much upon 
the particular field. 


Example 11.1, The polynomial x? — 2 is irreducible over Q, but reducible over R, 


since x2 —2 = (x — /2)(x + V2). 


Let f(x) = a9 + aix +--+ +anx" € F [x]. Ifr € F, we can evaluate f(x) at r, 
and obtain 
f(r) =ayp + aur +agr? +--+ + a_r”. 


In this way, we obtain a function (not a homomorphism!) a : F — F given by 
a(r) = f(r). In dealing with polynomials in R[x], we are accustomed to identifying 
the polynomial f(x) with this function a. But over a more general field, we cannot 
do this. Indeed, two different polynomials can induce the same function. 


Example 11.2. In Zs[x], the polynomials f(x) = x3+x+1 and g(x) = wP+4+x3 41 
induce the same function. That is, f(r) = g(r) for all r € Zs. (There are only five 
elements in Zs, so this is easily checked.) 


It is worth mentioning that we do obtain a homomorphism if we fix an element r 
of the field and consider evaluating polynomials at r. 
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Lemma 11.1. Let R be a commutative ring and fix r € R. Then the function a : 
R[x] > R given by a(f (x)) = f(r) is a homomorphism. 


Proof. Let f(x) = do +--+ + ay)x” and g(x) = bo +--+ + dyx” be arbitrary 
polynomials in R[x] (adding in terms with coefficient zero if necessary). Then 


a( f(x) + g(x) =a9 + bo tar +bhyr+-+-+ayr" + br" 
= (do +--+ +anr") + (bo +--+ + bar") 
= a(f(x)) + a(g(x)). 


Also, writing f(x)g(x) =co+t---+ Con x2", where c; = dob; +----+a;bo, we have 
a(f(x)g(x)) = co +++ + Cm", 


whereas 
a( f (x))o(g(x)) = (ao +--+ + anr")(bo +--+ + bar”). 


But for any i, 
ai 


agbjr' + ayrbj_yr’ +---+tar'bo =cir', 


and so a(f (x)g(x)) = a(f (x))a(g(x)). 


We can now use the division algorithm to write a polynomial over a field F as a 
multiple of x — a, for any a € F, plus a constant. 


Theorem 11.1 (Remainder Theorem). Let F be a field and f(x) € F[x]. Take 
any a € F. Then there exists a q(x) € F[x] such that 


f(x) = (« — ayq(x) + f(a). 
Proof. By the division algorithm for polynomials, f(x) = (« — a)q(x) + r(x), 


where g(x), r(x) € F[x], and either r(x) is the zero polynomial, or deg(r(x)) < 
deg(x — a) = 1. That is, r(x) is some constant, b € F. By the preceding lemma, 


f(a) = (a—a)q(a)+b=b. 


It is crucial for us to know if a polynomial has any roots. 


Definition 11.1. Let F be a field and f(x) € F[x]. Ifa € F, then we say that a is 
aroot of f(x) if f(a) = 0. 


Example 11.3. The polynomial x” — 2 has no roots in Q. However, if we regard it 
as a polynomial over R, we see that /2 and —/2 are roots. 
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Recall that if f(x), g(x) € F[x], we say that f(x) divides g(x), and write 
Ff ()|g(x), if there exists an h(x) € F[x] such that g(x) = f(x)h(x). 


Theorem 11.2 (Factor Theorem). Let F be a field and f(x) € F[x]. Take any 
a € F. Thena is a root of f (x) if and only if (x — a)|f (x). 


Proof Suppose that a is a root of f(x). By the Remainder Theorem, we have f(x) = 
(x — a)q(x), and hence (x — a)| f(x). Conversely, suppose that (x — a)| f(x). Then 
f(x) = (x —a)g(x), for some g(x) € F[x]. In this case, f(a) = (a — a)g(a) = 0, 
and hence a is a root. 


Example 11.4. InZ [x], let f(x) = 3x3+5x?+4x+4. We note that 2 is a root. Thus, 
x — 2 (in other words, x +5) must divide f (x). In fact, f(x) = (x —2) (3x? +4x+5). 


Corollary 11.1. Let F be afield and f(x) € F[x]. If deg(f(x)) > 1 and f (x) has 
a root in F, then f (x) is reducible over F. 


Proof. Let a be a root of f(x). By the Factor Theorem, f(x) = (x — a)g(x), for 
some g(x) € F[x]. Since deg(f(x)) > 1, we note that g(x) is not a constant. Thus, 
Ff (x) is reducible. 


The converse is false! 


Example 11.5. In R[x], let f(x) = x++2x?+ 1. For anya € R, we have f(a) > 1; 
thus, f(x) has no real roots. However, f(x) = (x? + 1)”. Thus, Ff (x) is reducible. 


However, for polynomials of degree 2 and 3, the converse does hold. 


Corollary 11.2. Let F be a field and f (x) € F[x]. Then 


I. ifdeg(f(x)) = 1, then f (x) is irreducible over F ; and 
2. if f (x) has degree 2 or 3, then f (x) is irreducible over F if and only if it has no 
roots in F. 


Proof. (1) If f(x) = g(x)h(x), then by Theorem 10.2, either g(x) or h(x) has 
degree 0. 

(2) If f(x) is irreducible, then the preceding corollary tells us that f(x) has no 
roots. Suppose that f(x) is reducible, say f(x) = g(x)h(x) for some nonconstant 
polynomials g(x) and h(x) in F[x]. As the sum of their degrees is 2 or 3, either g(x) or 
h(x) must have degree 1. Without loss of generality, say g(x) = ax+b, witha, b € F 
and a # 0. But then notice that f(—a~'b) = (a(—a~'b) + b)h(—a7'b) = 0. Thus, 
—a™'b is aroot of f(x). 


We can also put a limit on the number of roots of a polynomial. 


Corollary 11.3. Let F be a field and f(x) € F[x] a nonzero polynomial. If f (x) 
has degree n, then f (x) has at most n roots in F. 
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Proof. We proceed by induction on n. If n = 0, then f(x) is a nonzero constant 
polynomial, which clearly has no roots. Assume that our result is true for n, and let 
deg(f(x)) = n+ 1. If f(x) has no roots, then we are done. Otherwise, let a be a 
root. By Theorem 11.2, f(x) = (x — a)g(x), for some g(x) € F[x]. Furthermore, 
by Theorem 10.2, deg(g(x)) = n. Thus, our inductive hypothesis tells us that g(x) 
has at most 7 roots. Let b be any root of f(x). Then 0 = f(b) = (b — a)g(b). 
Therefore, either b — a = 0 (and b = a) or g(b) = 0 (and D is among the at most n 
roots of g(x)). Thus, f(x) has at most n + 1 roots, as required. 


Exercises 
11.1. Are the following polynomials irreducible in Z7[x]? 


1. x? +5x7+4x+3 

2. 6 x el 

3. x4+x742 

11.2. Write each of the following as products of irreducibles in Zs[x]. 


1. x3 +3x24+3x +2 
2. x3 4+2x2? +4742 
3. x4 42x03 +47 43 


11.3. Find every irreducible polynomial of degree 3 over Zz. 


11.4. If we divide 3x*? + 4x!° + 2 by x +5 in Z;[x], what is the remainder? (The 
answer must be in {0, 1, ..., 6}.) 


11.5. Let F be an infinite field. If f(x), g(x) € F[x], and f(a) = g(a) for all 
a € F, show that f(x) = g(x). 


11.6. Let p bea prime. Find infinitely many polynomials f\ (x), fo(x), ...in Zp[x] 
such that f;(a) = 0 for all a € Z, and all positive integers i. 


11.7. Is Lemma 11.1 still true for noncommutative rings? 


11.8. Let R be an integral domain. Show that U(R) has at most n elements of order 
n, for every positive integer n. Also give an example of a commutative ring R with 
identity which is not an integral domain for which this is not true. 


11.9. Let p be a prime number. Show that the following are equivalent: 


1. x? + 1 is reducible in Zp[x]; and 
2. there exist nonnegative integers m and n such that p = m+n and p|(mn — 1). 


11.10. Show that Theorems 11.1 and 11.2 remain true if F is replaced with an 
integral domain. 
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11.2 Irreducibility over the Rationals 


If we have a polynomial f(x) € Q[x], then by multiplying by a suitable positive 
integer, we obtain a polynomial in Z[x]. It is often simpler to start with a polynomial 
with integer coefficients. 

As we noted in the preceding section, a polynomial of degree greater than | in Q[x] 
is necessarily reducible if it has a root. Of course, there are infinitely many possible 
roots, so testing them all is impossible. However, we can narrow the possible roots 
down to a finite set of rational numbers. 


Theorem 11.3 (Rational Roots Theorem). Let f(x) = dj + a;x +--+: + 
a,x" € Z[x], with a, # 0. Suppose that q € Q is a root of f(x). Ifq = 4, 
with r,s € Zand (r,s) = 1, thenr|dag and s|dpy. 


Proof. We have 


r ar 
0= (=) =a +S +--+ + 
Ss S 
Multiplying through by s”, we obtain 
aos" + ars” | +--+ +a, yr" |s +a,r” = 0. 
As s divides every term except a,r”, it also divides a,r”. Since (7, s) = 1, Corollary 


2.2 tells us that s|a,. Similarly, r divides every term except aos”, so it also divides 
aos”. Since (r, s) = 1, we see that r|ao. 


Example 11.6. Let f(x) = 3x? + 2x? — 2x — 8. In view of the Rational Roots 
Theorem, the only possible rational roots of f(x) are +1, +2, +4, +8, ! Z, +4 
and +8. Trying them all, we see that the only rational root of f(x) is <. 


3? 


Of course, polynomials can be reducible without having roots. If we wish to 
restrict our attention to polynomials in Z[x], we must be sure that it makes sense to 
do so. At first blush, it seems conceivable that we could have a polynomial in Z[x] 
that factors into a product of polynomials of lower degree in Q[x], but not in Z[x]. 
In fact, this does not happen. Let us see why. 


Definition 11.2. If f(x) is a nonzero polynomial in Z[x], then the content of f(x) 
is the largest positive integer that divides every coefficient of f(x). We say that f(x) 
is primitive if its content is 1. 


Example 11.7. The polynomial 6x? — 15x? + 81x — 12 has content 3, whereas 
5x* + 14x — 2 is primitive. 


We can now present a famous result due to Carl F. Gauss. 


Lemma 11.2 (Gauss’s Lemma). The product of two primitive polynomials in Z[x] 
is also primitive. 
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Proof. Let f(x) = do +--+ + a,x" and g(x) = bo +--- + byx™ be primitive. 
Suppose that f(x)g(x) is not primitive. Let p be a prime dividing the content of 
f(x)g(x). As p cannot divide all of the coefficients of f(x), let i be the smallest 
nonnegative integer such that p does not divide a;. Similarly, let 7 be the smallest 
nonnegative integer such that b; is not divisible by p. Then the coefficient of x!*/ in 


f(x)g(x) is 
agbi4j + abi4 jar +++ + aj-1bj41 + adj + ajy1Dj-1 +++ +aj4j-1b1 + a4 jbo, 


where we add terms with coefficient zero if necessary. Now, this coefficient must be 
divisible by p. Also, p divides ax, 0 < k < i, and p divides b;,0 <1 < j. Thus, 
every term in the sum is divisible by p except ajb;, which means that p|a;b; as well. 
But this contradicts Theorem 2.7. 


As a consequence, we can see that if a polynomial in Z[x] is reducible in Q[x], 
then it is reducible in Z[x] as well. 


Theorem 11.4. Let f(x) be a polynomial in Z[x], and suppose that f(x) = 
g(x)h(x), where g(x), h(x) € Q[x]. Then there is a positive rational number q 
such that qg(x) and qx) lie in Z[x]. 


Proof. Assume, first of all, that f(x) is primitive. Choose positive integers a and b 
such that ag(x), bh(x) € Z[x]. Then abf (x) = (ag(x))(bh(x)). 

Let c be the content of ag(x) and d the content of bh(x). Then “8(X), Ph(x) E 
Z[x], and both are primitive polynomials. By Gauss’s lemma, their product, ae (x), 
is also primitive. Thus, the content of abf (x) is cd. But as f(x) is primitive, the 
content of abf (x) is also ab. Thus, ab = cd, and hence letting g = *, we see that 
b_ 1 


: Suppose that f(x) is not primitive. If it is the zero polynomial, then either g(x) or 
h(x) must be as well. Without loss of generality, say that h(x) is the zero polynomial. 
Then let g be a positive integer such that gg(x) € Z[x]. On the other hand, if f(x) 
is not the zero polynomial, then let k be its content. Writing f(x) = kfi(x), with 
fi) € Z[x], we have fi (x) = (¢g(x)) h(x). By the argument above, there exists a 
positive rational number g such that 4 g(x), qx) € Z[x]. But then gg(x), he) € 
Z[x] as well. 


Example 11.8. The polynomial f(x) = 3x? + 2x? — 2x — 8 has 4 as a rational 
root. Thus, by Theorem 11.2, g(x) = x — $ is a divisor of f(x) in Q[x]. Performing 
polynomial long division, we see that f(x) = g(x)h(x), where h(x) = 3x7 +6x +6. 
Using q = 3 in the above theorem, we find that f(x) = (3x — 4) (x? + 2x +2), and 
we have a factorization in Z[x]. 


Even if a polynomial has coefficients in Z, it can still be difficult to tell if it is irre- 
ducible over Q. One nice result that can be rather helpful is attributed to F. Gotthold 
M. Eisenstein, although a proof was first published by Theodor Sch6nemann. 
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Theorem 11.5 (Eisenstein’s Criterion). Let f(x) = ag +a, +---+a,x" € Z[x], 
withn > landa, 4 0. Suppose that there exists a prime p such that p\a;,0 <i <n, 
but p { a, and p* + a. Then f (x) is irreducible in Q{x]. 


Proof. If f (x) is reducible, then by Theorem 11.4, there exist nonconstant polyno- 
mials g(x) = bo+-- -+ bx! and h(x) = cot: ++ +x” in Z[x], with b; AOA CH 
and f(x) = g(x)h(x). Now, p divides ag = boco, but pp does not. Thus, p divides 
exactly one of {bo, co}. Without loss of generality, say p|bo. But p does not divide 
An = biCm. Thus, p divides neither b; nor c,,. Let i be the smallest positive integer 
such that p { b;. Then 


dj = boc, + bici-, +--+ + Bj-1¢1 + Dico. 


Now, p|b;,0 < j <i. Furthermore, asi </ <n, we know that p|a;. Thus, p|bjco. 
But p divides neither b; nor co, and we have a contradiction. 


Example 11.9. The polynomial 13x? — 42x? + 81x — 15 is irreducible over Q, using 
Eisenstein’s criterion with p = 3. 


Example 11.10. For any positive integer n and any prime p, we observe that x” — p 
is irreducible over Q. 


Note that if F is a subfield of K, and f(x) is a reducible polynomial in F[x], then 
it is also necessarily reducible in K [x] (just using the same factorization). Of course, 
the fact that it is reducible in K[x] does not imply that it is reducible in F[x], as we 
illustrated in Example 11.3. 

But the relationship between Z[x] and Q[x] is backwards. Indeed, we have seen 
that if a polynomial in Z[x] is reducible in Q[x], then it is also reducible in Z[x]. 
The other direction does not work! 


Example 11.11. Let f(x) = 2x — 6. Then by Corollary 11.2, f(x) is irreducible 
in Q[x]. But f(x) is reducible in Z[x]; indeed, f(x) = 2(x — 3), and neither 2 nor 
x — 31s aunit in Z[x]. 


The problem, then, is that the nonzero constants are not necessarily units in Z[x], 
and this affects irreducibility. 


Lemma 11.3. Let f(x) € Z[x]. Then f (x) is irreducible in Z[x] if and only if either 


I. f (x) is a (positive or negative) prime in Z; or 
2. f(x) is a primitive polynomial that is irreducible in Q[x]. 


Proof. Note that a unit in Z[x] is also a unit in Q[x], and hence a constant. But the 
only constants having inverses in Z[x] are +1, so those are the only units. 

Suppose that f(x) is aconstant c € Z. Ifc is prime, then its only factorizations are 
1-c and (—1)(—c), so f (x) is irreducible. Otherwise, c has some other factorization, 
and f(x) is not irreducible. 
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So, let deg(f(x)) => 1. Suppose that f(x) is irreducible in Z[x]. If f(x) has 
content d > 1, then f(x) has a factorization d (4 ve (x)), and therefore f(x) is 
reducible. So, we may assume that f(x) is primitive. If it is reducible in Q[x], then 
by Theorem 11.4, it is reducible in Z[x] as well. Conversely, assume that f(x) is 
irreducible in Q[x] and primitive. If f(x) = g(x)h(x), with g(x), h(x) € Z[x], then 
we have a factorization in Q[x] as well, which would make f(x) reducible over Q, 
unless either g(x) or h(x) is aconstant. Without loss of generality, let g(x) = e £ 0. 
If e = +1, then g(x) is a unit in Z[x]. If not, then f(x) has content |e| times the 
content of h(x), contradicting the assumption that f(x) is primitive. Thus, f(x) is 
irreducible in Z[x] in this case. 


Let us now present the counterexample promised in Section 10.4. We already 
know that Z[x] is not a PID. But we have the following. 


Theorem 11.6. The ring Z[x] is a UFD. 


Proof. Let f(x) € Z[x] be a nonzero nonunit. We will show that f(x) is a product 
of irreducibles. First, suppose that deg(f(x)) = n > 1. We claim that f(x) is a 
product of polynomials in Z[x] that are irreducible in Q[x]. Our proof is by strong 
induction on n. Ifn = 1, then f(x) is irreducible in Q[x] and there is nothing to do. 
Let n > 2, and suppose that our claim holds for polynomials of smaller degree. If 
f (x) is irreducible in Q[x], then again, there is nothing to do. Otherwise, we know 
that f(x) = g(x)h(x), where g(x) and h(x) are polynomials of degree less than n 
in Q[x]. By Theorem 11.4, we may choose g(x) and h(x) to be in Z[x]. Then our 
inductive hypothesis tells us that g(x) and h(x) are products of polynomials in Z[x] 
that are irreducible in Q[x], and hence, so is f(x), proving the claim. 

If f(x) = fi(x)--- f;(x), where each f; (x) is irreducible in Q[x], then let c; be 
the content of f; (x). We now have 


1 1 
f(x) = (C1-+ + cx) (= A009) oe (£09) ; 


where each + f(x) is irreducible in Z[x], by the preceding lemma. 

Thus, bringing the deg( f (x)) = 0 possibility back into consideration, we see that 
J (x) is either an integer not in {0, +1}, or a nonzero integer multiplied by a product 
of irreducibles in Z[x]. 

It remains only to consider the case of an integer. But the Fundamental Theorem 
of Arithmetic tells us that any integer not in {0, +1} is a product of (positive or 
negative) primes, which are certainly irreducible in Z[x]. We do still have to deal 
with f(x) = (—1)gi(x) --- gx(x), where each g;(x) is irreducible, but then this is 
(—g1(x))go(x)--- gx(x), and —g; (x) is irreducible as well. 

Let us verify the uniqueness. Suppose that 


FO) = Pi->+ PeBi(®) +++ BX) = 41+ ++ dmb) +++ hn), 
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where each p; and gq; is a (positive or negative) prime in Z, and each g; (x) and h; (x) 
is a primitive polynomial which is irreducible in Q[x]. (We allow the possibility that 
k,l, m orn may be zero.) By Gauss’s lemma, the product of primitive polynomials is 
primitive. Thus, the content of f(x) is |pi--- pe| = |q1--- ql. By the Fundamental 
Theorem of Arithmetic, k = / and after rearranging, each p; = -tq;. Cancelling, 
we have g1(X) +--+ 8m(x) = £h)(x)---h,(x). But these are products of irreducible 
polynomials in Q[x]. As Q[x] is a UFD, m = n and, after rearranging, each g;(x) = 
gih;(x), for some q; € Q. Write g; = a with r;,s; € Zand s; 4 0. Then s;g;(x) = 
rjhj(x). As g;(x) and h;(x) are primitive, looking at the content of each side of the 
equation, we have |s;| = |r;|, and hence g; € {1, —1}. We are done. 


Exercises 


11.11. Find all rational roots of each of the following polynomials. 


1. x3 — 7x? + 5x +2 
2. 6x4 —x3 + 4x2 —x —2 


11.12. Are the following polynomials irreducible over Q? 


1. 3x4 + 15x3 — 25x? + 45x + 10 
2. 2x3 +5x7+x+7 
3. x!4 75 


11.13. Write each of the following polynomials as a product of irreducibles in Q[x]. 
1. x* — 10x3 + 35x? — 48x + 18 

2. x442x3 +224 3x42 

11.14. Write each of the following polynomials as a product of irreducibles in Z[x]. 
1. 6x4 + 84x? — 126x 

2. 6x* — 3x3 + 18x? — 3x —3 


11.15. Let F be a field, a € F and f(x) € F[x]. Show that f(x) is irreducible if 
and only if f(x + a) is irreducible. 


11.16. Modify Eisenstein’s criterion as follows, namely, insist that p|aj,1 <i <n, 
but p { ao and p* { ay. Show that the result still holds. 


11.17. Is 7x® + 21x° — 49x3 + 14x? + 7x + 2 reducible or irreducible over Q? 


11.18. Let R be a Euclidean domain. If f(x) € R[x] is a nonzero polynomial, let 
us say that it is primitive if the only common divisors of its coefficients are the units 
of R. Show that Gauss’s lemma holds in R[x]. 
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11.3 Irreducibility over the Real and Complex Numbers 


While the real numbers may seem like a more natural field with which to work, 
the complex numbers have a more attractive algebraic structure. Indeed, we wish 
to consider the complex numbers, because there are nonconstant real polynomials, 
such as x* + 1, having no real roots. The complex numbers do not have this problem. 
Indeed, this is a famous result known as the Fundamental Theorem of Algebra. 
There are many different proofs of this theorem. Curiously, to the best of the author’s 
knowledge, all of these proofs require results from outside of algebra. A proof that 
is mostly algebraic can be found in the advanced textbook of Dummit and Foote [1]. 
(Sadly, the algebra involved is somewhat beyond the scope of this course.) 


Theorem 11.7 (Fundamental Theorem of Algebra). Let f(x) be a nonconstant 
polynomial in C[x]. Then f (x) has a root in C. 


We say that the field of complex numbers is algebraically closed. 


Corollary 11.4. If f(x) € C[x], then f(x) is irreducible if and only if 
deg( f(x) = 1. 


Proof. Combine Theorem 11.7 with Corollaries 11.1 and 11.2. 


Corollary 11.5. Let f(x) € C[x] be a nonconstant polynomial. Then there exist 
a, C1, C2,-+-; Cy, € C such that f (x) = a(x — cy) (% — €2) +++ (X — Cp). 


Proof. We proceed by induction on deg(f (x)) = n.Ifn = 1, then f(x) =ax+b= 
a(x — (—a~'b)), for some a, b € C, witha 4 (0). Suppose that the result is true for 
n, and let deg(f(x)) = n+ 1. By Theorem 11.7, f(x) has a root, c,+; € C. But 
then Theorem 11.2 tells us that f(x) = g(x)(« — cCy41), where deg(g(x)) = n, by 
Theorem 10.2. Now apply our inductive hypothesis to g(x). 


Thus, complex polynomials behave as nicely as we could possibly wish. What 
about real polynomials? The situation there is slightly more complicated. 


Lemma 11.4. Let f(x) € R[x]. Ifc,d € R, and c + di is a complex root of f (x), 
then so is c — di. 


Proof. Write c+ di = c — di. Let f(x) = ag +--+: + a,x", a; € R. Then if 
z=c+dli, we have 


f @) = aq + az + an (2)? +--+ + ay (Z)”. 


But by Example 9.13, the function mapping z to z is a homomorphism. Thus, 
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f@ =a tayZ + agz2 +--+ +ayz" 
= Gp + HZ + He +--+ + H,2" 
= do + ayZ + apz7 +++ + nz” 
=f/@=0=0, 


making using of the fact that each aj = a;, since a; € R. 


We can use this to classify the irreducible real polynomials. 


Theorem 11.8. Let f(x) € R[x]. Then f(x) is irreducible over R if and only if 
either 


I. deg(f(x)) = 1; or 
2. fxa= ax? + bx +c, where a 4 0 and b? < 4ac. 


Proof. Since R is a field, constant polynomials are either 0 or a unit, and therefore 
need not be considered. If deg( f(x)) = 1, then Corollary 11.2 tells us that f(x) is 
indeed irreducible. Therefore, let f(x) have degree at least 2. Suppose that f(x) is 
irreducible. 

By Theorem 11.7, f(x) hasarootz = a+bi € C.Ifz € R, then by Corollary 11.1, 
we have a contradiction. Assume otherwise. By Lemma 11.4, a — bi is also a root. 
Expressing f(x) as in Corollary 11.5, we see that (x — (a+ bi))(x — (a—bi))| f(x) 
in C[x]. But (x — (a + bi))(x — (a — bi)) = x? — 2ax + (a* + b*) € R[x]. Thus, 
applying the division algorithm, we see that there exist g(x), r(x) € R[x] such that 
f(@) = (x? — 2ax + (a? + b*))q(x) + r(x), and r(x) = 0 or deg(r(x)) < 2. 
By the uniqueness of the division algorithm in C[x], we must have r(x) = 0 and 
x? — 2ax + (a? + b’) divides f(x) in R[x]. In particular, if deg(f(x)) > 2, then 
f(x) must be reducible. 

Thus, we may assume that f(x) = ax? + bx +c, witha 4 0. By Corollary 11.2, 
such a polynomial is irreducible over R if and only if it has no roots in R. But the 
quadratic formula tells us that this happens if and only if b? — 4ac < 0. We are done. 


We can use this to recover a well-known fact from calculus. 


Corollary 11.6. Let f(x) € R[x] be a polynomial of odd degree. Then f (x) has a 
real root. 


Proof. We know that R[x] is a UFD. Thus, write f(x) as a product of irreducible 
polynomials. By the preceding theorem, each such irreducible has degree 1 or 2. 
Since f(x) has odd degree, at least one of these irreducible polynomials has degree 
1. Therefore, there exist a, b € R, with a ¥ 0, such that ax + b divides f(x). But 
then —a~'b is aroot of f(x). 
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Exercises 


11.19. Given that a is a root of f(x), find all complex roots of f(x). 


lL. f(x) =x? — 11x? 4+ 41x —-91,a = 243i 
2: fe) =xt4+2x?-2x4+6,a=1-i 


11.20. Given that a is a root of f(x), find all complex roots of f(x). 


1. f(x) =x? +x? —5x —21,a=3 
2: f(x) = x4 — 6x? + 33x? — 84x + 136,a = 144i 


11.21. Write each of the following polynomials as a product of irreducibles in Q[x], 
R[x] and C[x]. 


1. x*+— 10 
2. x3 +2x24+5x —22 


11.22. Write each of the following polynomials as a product of irreducibles in Q[x], 
R[x] and C[x]. 


1. x? +12 
2. x4 + 4x? +4 


11.23. Find a nonzero polynomial in R[x] having 2 — 5i, 4+ 7 and 6 as roots. 


11.24. Let f(x) and g(x) be nonzero polynomials in Q[x]. Consider the geds of 
Ff (x) and g(x) in Q[x], R[x] and C[x]. Must these gcds be the same, or can they be 
different? 


11.4 Irreducibility over Finite Fields 


When our field is finite, we have the luxury of taking a brute force approach to 
factoring polynomials. That is, we can simply list all of the polynomials of suitable 
degrees, and see if the products work. Of course, we can save ourselves some effort 
by narrowing the possibilities first. 


Example 11.12. Let f(x) = xt 4x3 +x? +x +41 © Zo[x]. We claim that Ff (x) is 
irreducible over Zp. If not, there are two possibilities. First, f (x) could be a product of 
a degree | polynomial and a degree 3 polynomial. But if it has a degree 1 polynomial 
as a factor, then it has a root. There are only two possible roots in Z2, namely, 0 
and 1, and neither works. Second, f(x) could be a product of two polynomials of 
degree 2. Now, the only possible coefficients are 0 and 1. Furthermore, the leading 
coefficients and the constant terms must multiply to give 1. Thus, the only possible 
factors are x7 + 1 and x? +x +1. But x? + 1 has 1 as a root, and Ff (x) does not, so 
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only x? + x + 1 remains. However, (x? +x +1)? = x*4+x74+1 # f(x). Thus, 
Ff (x) is indeed irreducible. 


Example 11.13. Let f(x) = 3x° + x* + 4x3 4+ 4x? + 3x +2 € Zs[x]. We would 
like to write f(x) as a product of irreducibles. The first thing we should do is check 
for roots. We run through the five elements of Zs; and find that 3 is a root. Thus, 
x — 3 (or, equivalently, x + 2) divides f(x). Performing polynomial long division, 
we find that f(x) = (x + 2)(x4 + 4x? + x + 1). Let g(x) = 3x4 + 4x7 4+ x41. 
Evaluating g(x) at each element of Zs, we see that g(x) has no roots. Thus, if it 
is to be factored, it must be as a product of two polynomials of degree 2. Up to a 
unit in Zs, these factors would have to be x” + ax + b and 3x? + cx +d, for some 
a,b,c,d € Zs. Furthermore, looking at the constant terms, we have bd = 1. Thus, 
once b is decided, d = b7!. Looking at the coefficients of x3, we have 3a +c = 0. 
Thus, once a is decided, we have c = 2a. Trying the various possibilities for a and 
b, we have g(x) = (x? + 2x +3) (3x? +4x +2). Since g(x) has no roots, this cannot 
be factored any further. Thus, f(x) = (x + 2)(x? + 2x + 3)(3x? + 4x + 2) isa 
product of irreducibles in Zs[x]. 


Our ability to handle polynomials over finite fields can be helpful when we con- 
sider polynomials in Q[x]. 


Theorem 11.9. Let f(x) = ay + a,x +--+ +a,x" € Z[x]. Let p be a prime such 
that p { ay. Reducing all of the coefficients modulo p, if [ag] + [a.]x +-+- + [an ]x” 
is irreducible in Z,[x], then f (x) is irreducible in Q{x]. 


Proof. Suppose f(x) is reducible in Q[x]. Then by Theorem 11.4, we must have 


F(x) = g(x)h(x), where g(x) = bp +--- + byx* and h(x) = co #- ++ + mx” are 
polynomials in Z[x], with k,m > 0 and bk 4 0 # cy. Now, we have 


a; = boc; + bic;_-1 + +++ + bico, 


for each i. By Example 9.12, the function from Z to Z, sending d to [d] is a ring 
homomorphism. Thus, 


lai] = [bollei] + [bi]lci-1] + --- + [Billo]. 
It now follows that 
[ao] + +++ + [an]x” = (bol + +++ + [belx") Leo] + +++ + Eem]x””). 
That is, [dg] + --- + [a,]x” is reducible, unless one of the factors is a constant 
polynomial. But as p { ay, we see that the degree of [a9] +---+[a,]x" isn =k-+m. 


The only way the product will have the correct degree is if [bg] 4 [0] 4 [cm]. This 
contradiction completes the proof. 
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Note that the condition that p does not divide the leading coefficient is important. 
Indeed, 3x” + x — 4 is reducible in Q[x], as it is (x — 1)(3x + 4). But if we tried to 
use p = 3, we would obtain x + 2 € Z3[x], which is certainly irreducible. Also, the 
converse of the theorem is not true. For instance, x” + 1 is irreducible over Q (or, 
for that matter, R), but in Zs[x], we have x7 + 1 = (x + 2)(x + 3). 


Example 11.14. We claim that 15x*+ — 29x? + 13x? + 33x — 201 is irreducible over 
Q[x]. Use Theorem 11.9 with p = 2. In Z,[x], we obtain x4 + x7 +47 +x+41 
which, by Example 11.12, is irreducible. 


Sometimes, we might have to try more than one prime. 


Example 11.15. Let f(x) = 5x? + 3x7 +x +1. If we use p = 2, we obtain 
x? +x? +x +1 € Z,[x]. But this polynomial has 1 as a root, so it is reducible. No 
help here! Let us try p = 3. Then we get 2x3 + x + 1 € Zs[x]. By Corollary 11.2, 
it is irreducible if it has no roots. But trying 0, | and 2, we see that it has no roots in 
Z3. Thus, f (x) is irreducible in Q[x]. 


Exercises 


11.25. Are the following polynomials reducible or irreducible over the rationals? 


1. f(x) =x? 4+ 5x? +2x+ 16 
2. f(x) = 22x4 — 9x3 + 16x? + 18x + 20 


11.26. Are the following polynomials reducible or irreducible over the rationals? 
1. f(x) = 9x4 — 15x37 + 8x? — 6x + 25 
2. f(x) = 2x4 + 11x? + 16x? + 5x +6 


11.27. Let F be a finite field with n elements. How many monic irreducible poly- 
nomials of degree 2 are there in F[x]? 


11.28. Write each of the following polynomials as a product of irreducibles in 
Zi [x]. 


1, 2x3 + 3x? + 9x +10 
2. x4 44x34 5x27 4x47 


11.29. Let p be an odd prime. Show that x* + 1 is reducible over Z, in each of the 
following cases: 


1. there exists ana € Z, such that e= p-l 
2. there exists ana € Z, such that a? = p—2;0r 
3. there exists ana € Z, such that a2 =2. 
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11.30. Show that x* + 1 is irreducible in Q[x] but reducible in Z p|x] for every 
prime p. (Thus, the converse of Theorem 11.9 is wildly false!) 
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Chapter 12 Mm) 
Vector Spaces and Field Extensions cro 


We begin this chapter with some basic facts about vector spaces. These will be 
familiar (at least in the case of real vector spaces) to those readers who have studied 
linear algebra. We then focus our attention on the particular case of a field extension. 
A number of properties of field extensions are discussed. 

Let F be a field and f(x) € F[x] a nonconstant polynomial. We demonstrate 
how to create a field extension in which f(x) splits into a product of polynomials of 
degree 1. This leads to a classification of all finite fields. 


12.1 Vector Spaces 


We begin with the definition of a vector space. In most linear algebra courses, vector 
spaces are defined over R or, occasionally, C. But we can do the same thing over any 
field. 

If F is a field and V isa set, then a scalar multiplication on V is a function from 
FxVtoV.Ifae F,ve V, then we write av for the image of (a, v) under such a 
function. 


Definition 12.1. Let F be a field. Then a vector space over F is a set V having an 
addition operation and a scalar multiplication such that 


1. V is an abelian group under addition; 

2. av € V foralla € F andallve V; 

3. (a+b)v=av+byv foralla,b € F andallv € V; 

4. au+v)=au-+ay foralla € F andallu,v eV; 

5. a(bv) = (ab)v for all a, b € F and all v € V; and 

6. lv=vforallve V. 
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Of course, condition (2) is redundant, given the definition of a scalar multiplica- 
tion, but we include it, because it must be checked. 
Certainly the most familiar vector space over R is R”. We can generalize this. 


Example 12.1. Let F bea field. For any positive integer n, let F"= FOF OQ--- OF. 
—<——$., —__<_<_—J4 
n times 
Then F” is a vector over F with the usual addition operation and scalar multiplication 
a(by,..., bn) = (abi, ..., ab,), for any a, bj,...,b, € F. 


Example 12.2. Let F be any field. Then F'[x] is a vector space over F with the usual 
polynomial addition and a(bp + bix +--+ + b,x") = abo + abjx +--+ + ab,x", 
for any a, bo, ...,b, € F. 


Example 12.3. Let m and n be any positive integers, and let V be the set of m x n 
matrices with entries in a field F. Then V is a vector space over F using matrix 
addition and scalar multiplication. 


The least exciting example of a vector space is the following. 


Example 12.4. Let F be any field and V the trivial additive group, {0}. Then V is 
a vector space using the only available addition and scalar multiplication options, 
0+0=0Oanda0=0, forallae F. 


The most important example for our purposes is the following. 


Definition 12.2. If F and K are fields, with F a subfield of K, then we say that K 
is an extension field of F. 


Example 12.5. Any extension field K of F is a vector space over F,, using the 
addition operation in K and multiplication in K as the scalar multiplication. (All 
the properties are immediate, except that lv = v for all v € K. To be sure of that, 
we must know that the identity of F is the identity of K. But this follows from 
Theorem 8.12.) For example, R and C are vector spaces over Q. 


Let us mention a few basic properties of vector spaces. 


Theorem 12.1. Let V be a vector space over F. Then 


I, a0=Oforallaeé F; 
2. Ov=0 forall v € V; and 
3. (-lv =-vforallv € V. 


Proof. (1) Note that a0 = a(0+ 0) = a0 + a0. Adding —(a0) to both sides, we see 
that a0 = 0. 

(2) We have Ov = (0+ 0)v = Ov + Ov. Adding —Ov to both sides, we obtain the 
desired conclusion. 

(3) Observe that v + (-l)v = lv+ (-1l)v = (1 — 1)v = Ov = O. Thus, 
(-l)v = -v. 
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We do have to be a bit careful about which 0 we are using. For example, when 
we write Ov = 0 in the theorem above, the first 0 is in F and the second is in V. 


Definition 12.3. Let V be a vector space over a field F'. Then a subset W of V is 
said to be a subspace of V if it is a vector space over F using the same addition and 
scalar multiplication. 


Example 12.6. If F is a subfield of K, and K is a subfield of L, then L is a vector 
space over F having K and F as subspaces. 


Example 12.7. Regarding R[x] as a vector space over Q, we note that Q[x] is a 
subspace. 


There is a simple test for a subspace. 


Theorem 12.2. Let F be a field and V a vector space over F. Then a subset W of 
V is a subspace if and only if 


I. 0EW; 
2. Wi +W2 € W for all w,, w2 € W (closure under addition); and 
3. aw € W foralla € F andw € W (closure under scalar multiplication). 


Proof. If W is a subspace then, in particular, it is an additive subgroup, so (1) and 
(2) hold. Part (3) is one of the conditions for a vector space. Conversely, suppose that 
(1), (2) and (3) hold. Noting that (3) tells us that -w = (—1)w € W, forall w € W, 
we see from Theorem 3.10 that W is an additive subgroup of V. We are given closure 
under scalar multiplication. The remaining vector space properties hold in V, and 
therefore in any subset of V. 


Note that in the preceding theorem, condition (1) could be replaced with the 
condition that W is not the empty set, for if w € W, then —w e€ W, and therefore 
O0O=w+ (-w)e€ W. 

Example 12.8. Let V = R*, which is a vector space over R. We claim that W = 
{(a, b, 2a —b+ 3c, c): a, b,c € R} is a subspace of V. Letting a = b = c = 0, we 
see that (0, 0, 0,0) € W. To check closure under addition, take a;, b;, c; € R. Then 


(a1, by, 2a; — by + 3c], €1) + (ay, b2, 2a2 — bz + 3c2, €2) 
= (a) + a2, by + bz, 2(a, + az) — (db) + bz) + 3(cC1 + €2), C1 + €2) € W. 


Similarly, if a € R, then 
a(a,, bj, 2a, — by + 3c1, cy) = (aay, aby, 2aa, — ab; + 3ac,,ac,) € W. 


Thus, we have closure under scalar multiplication, and the claim is proved. 
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Exercises 


12.1. Let F be a field and n a positive integer. If V is the set of all polynomials of 
degree n in F [x], together with the zero polynomial, is V a subspace of F[x]? 


12.2. Let F be a field and n a positive integer. If V is the set of all polynomials 
of degree at most n in F'[x], together with the zero polynomial, show that V is a 
subspace of F[x]. 


12.3. Let V be a vector space having subspaces U and W. Show that UN W isa 
subspace of V. Extend this to the intersection of an arbitrary collection of subspaces. 


12.4. Let V be a vector space having subspaces U and W. Show that U + W 
(regarding U and W as additive subgroups of V) is a subspace of V. 


12.5. Let V and W be vector spaces over a field F. A function a : V > W is said 
to be a linear transformation if w@(v; + v2) = @(v,) + @(v2) and a(av;) = aa(v,) for 
alla € F,v1, v2 € V. If U is a subspace of V, show that a(U) is a subspace of W. 


12.6. Let F, V, W and a@ be as in the preceding exercise. Show that the kernel of 
a (regarding a@ as a homomorphism of additive groups) is a subspace of V. Further 
show that @ is one-to-one if and only if the kernel is {0}. 


12.7. Let F = Z), and V = F?. If W = {(a,b,c) € V: 2a+ 3b 4+ 7c = 0}, is W 
a subspace of V? 


12.8. Let F be a field with vector spaces V and W. Let U = V x W be the direct 
product of the additive groups V and W. Define a scalar multiplication on U via 
a(v, w) = (av, aw) for alla € F,v € V andw e€ W.Is U a vector space over F? 


12.9. Let F be a field of characteristic 3 and V a vector space over F’. Show that 
v+v+v=Oforallve F. 


12.10. Suppose that V is a vector space over an infinite field F. Show that V is not 
the union of a finite number of proper subspaces. 


12.2 Basis and Dimension 


In order to define a basis for a vector space, we must first discuss linear combinations 
of vectors. 


Definition 12.4. Let V be a vector space over a field F’. If vj, v2,..., ve € V, then 
a vector v € V is said to be a linear combination of the v; if v = ayv) +--- + axv,, 
for some a; € F. 


Example 12.9. Let F = Qand V = F?. If v; = (2, —3, 7) and v2 = (4, 0, 1), then 
(24, —6, 19) is a linear combination of v; and v2, since (24, —6, 19) = 2vy + 5yo. 


12.2 Basis and Dimension 211 


Definition 12.5. Let F bea field and V a vector space over F. Let v1, v2,..., ve € V. 
We say that the v; are linearly dependent if there exist a), ..., a, € F, not all zero, 
such that av) +--+: + a,v, = 0. Otherwise, the v; are linearly independent. 


Example 12.10. Let F = Zs and V = F?. The vectors (2, 1,3), (1,3, 0) and 
(2, 1, 4) are linearly dependent, since 3(2, 1, 3) + (1,3, 0) + 4(2, 1, 4) = (0, 0, 0). 
On the other hand, (1, 0, 4), (3, 2, 1) and (2, 0, 2) are linearly independent. Indeed, 
if a, (1, 0, 4) + ao@, 2, 1) + a3(2, 0, 2) = (0, 0, 0), then looking at the middle entry, 
we see immediately that ag = 0. Then a; + 2a, = 4a; + 2a3 = 0. This yields 
3a, = 0, and hence a; = 0 and, finally, az = 0. 


Here is a handy test for linear dependence. 


Theorem 12.3. Let V be a vector space over a field F and v,,...,v% € V. Then 
the v; are linearly dependent if and only if either 


I. vy, = 0; or 
2. there exists anm > 2 such that Vv, is a linear combination of v,, ..., Vm—1- 


Proof. Suppose that the v; are linearly dependent. Choose a; € F, not all zero, such 
that ajv; +---+axv, = 0. Let m be the largest positive integer such that a,, 4 0. 
Then ajvj +--+ + Gm_V¥m = 0. If m = 1, then ajv; = 0, with a; ~ 0. Thus, 
vy, = a;,'0 = 0, giving case (1).Ifm > 1, then vy, = —az!aivi—---—ay!}dn—1Vm—1; 
and so v,, is a linear combination of v1, ..., ¥n—1, which proves case (2). 

Conversely, suppose that (1) or (2) is satisfied. If v; = 0, then 1vy + Ov2 +---+ 
Ov, = 0, meaning that the v; are linearly dependent. If v,, = bivy) +---+D0m—1Vm-1, 
for some b; € F, then 


byvy a Dm—1Vm-1 _ lv 7 OVin+1 Eh Ov, = 0. 


Again, the v; are linearly dependent. 


Linear independence is most useful when combined with another property. 


Definition 12.6. Let V be a vector space over a field F,, and let v),..., 4 € V. 
Then we say that the v; span V if every v € V is a linear combination of the v;. 


Example 12.11. Regarding C as a vector space over R, we note that 1 andi span C, 
asa+bi =al+ bi. 


Example 12.12. Let F = R and V = R?. Then the vectors (1, 0, 0), (0, 1, 0) and 
(0, 0, 1) span V, since (a, b, c) = a(1, 0, 0) + b(O, 1, 0) + c(0, 0, 1). 


The following lemma describes a very nice relationship between linear indepen- 
dence and spanning. 


Lemma 12.1. Let V be a vector space over a field F. Suppose that v,, ..., Vg span 
V.Ifwi,...,wi € V, andl > k, then the w; are linearly dependent. 
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Proof. Since the v; span V, we know that w is a linear combination of the v;. Let 
us say that wy = a1vj +--+ + avy, with a; € F. If all of the a; are zero, then w, 
is the zero vector. Thus, by Theorem 12.3, we are done. Therefore, we may assume 
that some a; is nonzero. Without loss of generality, say a; 4 0. We now observe that 


W1, V2, V3,---, Ve span V. Indeed, if v € V, then v = byv; + --- + Devg, for some 
b; € F. But 

y= a, wi =— a; ‘anv See Ay | AKVE. 
Thus, 


v = bay! wi + (b2 — diay 'an)v2 +++» + (be — diay ‘ax)ve, 


proving the claim. 

Now consider w. It is a linear combination of w1, v2, v3, ..., vz. Let us say that 
W2 = CW + Cov2 + 63V3 +--+ + ceVvg, With c; € F.Ifc; = 0 for alli > 2, then 
W2 is a linear combination of w,, proving that the w; are linearly dependent. Thus, 
we may assume that there exists ani > 2 with c; 4 0. Without loss of generality, 
say cz ~ 0. But then vp is a linear combination of w1, wo, v3, v4, ..., Ve. And just as 
before, we now deduce that w1, w2, V3, v4,..., Ve span V. 

Repeat this argument. We will conclude either that the w; are linearly dependent 
or, eventually, that w,,..., wx span V. But then w;z4, is a linear combination of 
W1,---, We. By Theorem 12.3, the w; are linearly dependent. 


What we really need is a basis for a vector space. 


Definition 12.7. Let V be a vector space over a field F’. We say that v,,...,v, EV 
form a basis for V if they are linearly independent and span V. 


Example 12.13. Regarding C as a vector space over R, we can see that 1 andi form 
a basis for C. 


Example 12.14. For any field F and any positive integer n, the vectors 
(1, 0,0,...,0), (0, 1,0,0,...,0),...,(0,0,...,0, 1) 


form a basis for F'”. 


Example 12.15. Let F be any field and V = F[x]. Then V has no finite basis. 
Indeed, if v;,...,v% € V, then any linear combination of these vectors must have 
degree no larger than the maximum of the degrees of the v;. On the other hand, for 
any positive integer n, let W be the set of all polynomials having degree at most n 
(including the zero polynomial). By Exercise 12.2, W is a subspace of V, and the 
polynomials 1, x, x?,...,x" form a basis. 


Theorem 12.4. Let V be a vector space over a field F. If v,,..., ve form a basis 
for V, then every element of V can be written uniquely in the form a,v, + +--+ 4kVx, 
with a; € F. 
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Proof. Since a basis spans the space, only the uniqueness needs to be proved. Suppose 
that 

QVy Hee + ayy = Div, + +++ + Deg, 
with a;, b; ¢ F. Then 


(a, — by)vy + +++ + (ag — Dg) vy = 0. 


By linear independence, a; = b; for all i. 


Bases are not unique. For instance, (1, 0) and (0, 1) form a basis for R? over R, 
but so do (1, 3) and (5, 2). However, any two bases for a vector space must have the 
same number of vectors. 


Theorem 12.5. Let V be a vector space over a field F. If v,,..., Vv, and w,..., WI 
are bases for V, then k = 1. 


Proof. Suppose the theorem is false. Without loss of generality, say k < /. Then 
V1,---, Ve span V. Since k < J, Lemma 12.1 tells us that w;,..., w; are linearly 
dependent. We have a contradiction. 


Definition 12.8. Let V be a vector space over a field F. If vj, ..., vg is a basis for 
V, then we say that V has dimension k, and write dim V = k (or dime V = k, if 
the field is unclear from the context). We also stipulate that dim{0} = 0. In either 
of these cases, V is finite-dimensional. If V has no finite basis, then V is infinite- 
dimensional. 


Example 12.16. For any field F and positive integer n, dim F” = n. See Example 
12.14. 


Example 12.17. The dimension of C over R is 2. See Example 12.13. 


Example 12.18. If F is any field, then F'[x] is infinite-dimensional. The vector space 
consisting of the polynomials of degree at most n over F, including the zero poly- 
nomial, has dimension n + 1. See Example 12.15. 


In a finite-dimensional space, we can discard vectors from a spanning set to obtain 
a basis, or add vectors to a linearly independent set to obtain a basis. 


Theorem 12.6. Let V be any vector space over a field F, with V 4 {0}. Take 
V1,---, Ve € V. Then 


I. ifvy,..., Vx span V, then some subset of {v1, ..., Vg} is a basis for V; and 
2. if v1,..., Ve are linearly independent, and dim V = n < ov, then there exist 
Vi+ls +++» Yn © V such that v\,...,V,_, forma basis for V. 
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Proof. (1) We proceed by induction on k. If k = 1, then since v; spans V, and 
V & {0}, we see that vy; 4 0, and hence v, is linearly independent. (If av; = 0, and 
04a € F,then0 = a~'av, = vy.) Thus, v; is a basis. Suppose the result is true for 
k,andletv,,..., ve41 span V. If they are linearly independent, there is nothing to do. 
Otherwise, refer to Theorem 12.3. If vj = 0, then v2,..., v441 span V as well. By 
our inductive hypothesis we are done. Otherwise, some v; is a linear combination of 
V1,---, ¥i-1- Without loss of generality, say] = k+1. Write ve4) = avi t---+agvg. 
If v € V, we know that 
v= dbyyy +--+ + bei Vest, 


for some b; € F. But then 


V= (bi + ay bey )vy H+ + Oe + ade) VE- 


Thus, vj,..., v¢ span V. Our inductive hypothesis completes the proof. 

(2) If v1, ..., ve span V, there is nothing to do. Otherwise, find v4; € V which 
is not a linear combination of vj,..., ve. Suppose that v;,...,ve41 are linearly 
dependent. Then a,v) +--+ + G¢41Ve41 = 0, for some a; € F. If agi, = O, then 
v1,---, Vx are linearly dependent, which is not the case. Otherwise, 

_ -1 -l . 

Ve+1 = Tap {AVI SS D4 AkVks 
that is, vg+1 is a linear combination of v;,..., vz, giving us a contradiction. There- 
fore, v;,..., Ve+1 18 a linearly independent set. Now repeat. This process must stop, 


because Lemma 12.1 tells us that V cannot have a linearly independent set with more 
than n vectors. 


Example 12.19. Let F = Q and V = Q’. The vectors (3, —7, 0) and (1, 2, 0) 
are easily seen to be linearly independent. Furthermore, (2,5, 8) is not a linearly 
combination of these two vectors. Thus, since dim V = 3, we see that the vectors 
(3, —7, 0), C1, 2, 0), (2,5, 8) form a basis for V. 


Exercises 


12.11. Let F = R and V = R’. Are the following sets of vectors in V linearly 
dependent or independent over F'? 


1. (1,3, 5), (2, 1,4), (7, 11, 23) 
2. (1,3, 4), (2, 2, 1), (3, 6, 3) 


12.12. Let F = Z, and V = M)(F). Are the following sets of vectors in V linearly 
dependent or independent over F'? 


 (43)-(05) (33) 
2» (34) (as) Go) 
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12.13. Do the following vectors span Q? (as a vector space over Q)? 


1. (1,0, 2), 2,5, 3), G, 5, 5) 
2. (1, 0, 2), (2, 3, 5), (0, 0, 4) 


12.14. Do the following matrices span Mz(Zs) (as a vector space over Zs), namely 
11 01 00 LO Vs, 
OO}? \L0O/? ALI)? VO): 


12.15. Let V = M2(C). Find the dimension of V as a vector space over C, and as 
a vector space over R. 


12.16. Let F = Z, and V = {(a, b,c) € F? : c = 3a + 5b}. Find the dimension of 
V over F. 


12.17. Let F bea field and V a finite-dimensional vector space. If W is a subspace 
of V, show that dim W < dim V, with equality if and only if W = V. (Do not 
assume, to begin with, that W is finite-dimensional.) 


12.18. Suppose that a vector space V with dimension n has subspaces U and W 
with dimensions m and k, respectively. If m + k > n, show that UM W 4 {0}. 


12.19. Let F, V, W and a be as in Exercise 12.5. Suppose that v;,...,v, € V are 
linearly independent and q@ is one-to-one. Show that a(1),..., @(v,) are linearly 
independent. 


12.20. Let F bea field and V a finite-dimensional vector space over F'. Say dim V = 
n € N. Show that there exists a bijective linear transformation (see Exercise 12.5 for 
the definition)a:V — F”. 


12.3 Field Extensions 


Let us now focus on our main vector space of interest: the field extension. 


Definition 12.9. Let K be a field extension of F. Then the degree of the extension 
is the dimension of K over F. We write [K : F'] = dim p K. The extension is finite 
if [K : F] < o and, in particular, quadratic if [K : F] = 2. 


Example 12.20. As we observed in Example 12.17, C is a quadratic extension of R. 


Example 12.21. Let K = {a + bv/2 + cW/4 : a,b,c € Q}. We claim that K is 
a subfield of R and, therefore, an extension field of Q. All of the properties of a 
subfield are easy to verify except, perhaps, that nonzero elements have inverses. 
Take 0 £ a+ b\/2 + cV/4 € K. Then notice that (a + bx + cx”, x3 — 2) divides 
x? — 2. But, by Example 11.10, x? — 2 is irreducible over Q. Thus, the gcd can only 
be a constant polynomial (in fact, 1, since we assume it to be monic). As Q[x] is a 
Euclidean domain, Theorem 10.6 guarantees that we can write 
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1 = u(x)(x? — 2) + v(x)(a + bx +c’), 


for some u(x), v(x) € Q[x]. But then 1 = v(\/2)(a + bv/2 4+ cv/4). As it is easy to 
see that v(/2) € K, we have an inverse for a + bv/2 + cv/4 in K, as claimed. 

In fact, [K : Q] = 3. To see this, we observe that {1, 2, /4} is a basis for K 
over Q. Clearly, these numbers span K.. If they are linearly dependent, then there are 
rational numbers a, b, c, not all zero, such that /2 is aroot of a+bx +cx’. But again, 
1 = (a+bx+cx’, x? —2), and we write 1 = u(x)(a+bx +cx?)+ v(x)(x? —2), for 
some u(x), v(x) € Q[x]. Evaluating at /2, we obtain 1 = 0, giving a contradiction 
and establishing that we have a basis. 


We are, in fact, engaging in a small abuse of notation here. If K is an extension 
field of F then, of course, F is also an additive subgroup of K. We could also use the 
notation [K : F] to mean the index of F in K as additive subgroup. This is not the 
same as the degree of the extension! For the remainder of the book, when we write 
[K : F], we will mean the degree of the extension. 

In the particular case of a finite field, we can illustrate the difference. By 
Lagrange’s theorem, the index of the additive groups would be ne However, the 
degree is calculated as follows. 


Theorem 12.7. Let K be a field extension of F, such that K is a finite field. Then 
[K : F] = log), |K\. 


Proof. First, we note that K must be finite-dimensional over F’. Indeed, the elements 
of K must span K, and by Theorem 12.6, we can obtain a finite basis. Let[K : F] =n, 
and suppose that {v,,..., v,}is a basis for K over F. By Theorem 12.4, the elements 
of K are uniquely of the form a,v; +---+a,v,, witha; € F. As there are | F'| choices 
for each a;, the total number of elements of K is | F'|”. Taking the base | F'| logarithm, 
we obtain our result. 


Degrees of extensions behave in a nice way. 


Theorem 12.8. Let K be a finite extension of F and L a finite extension of K. Then 
[L: F]=[L: K][K: F]. 


Proof. Let {v,,...,V,} be a basis for K over F, and let {w,,..., W»} be a basis for 
L over K. We claim that {vjw; : 1 <i <n,1< j < m} isa basis for L over F. 
This will complete the proof. 

Take any / € L. Then/ = ayw; + --- + dnWm, for some a; € K. But a; = 
div) +-++++ binVn, for some bj; € F. Thus, 


l= byv wy oC Di2V2W Sea DinVaWy See Dnt ViWn aes oe DinnVnWm- 


That is, the v;w; span L over F’. Suppose that they are linearly dependent. Then there 
exist bj; € F, not all zero, such that 
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0= buviwy ea DinVaW ei Dini ViWm se DinnVnWm 
= (bv Be Race DinVn)W1 eer (Om1V1 ae a DinnVn)Wm- 


As each bj;v; + +--+ Dinvn € K, and the w; are linearly independent over K, we 
have bjjv) +--+ + Dinva = O, for all i. But the b;; € F, and the v; are linearly 
independent over F’. Thus, all of the b;; are zero. The proof is complete. 


Example 12.22. Let K = {a+b/2: a, b € Q}. By Example 8.30, K is an extension 
field of Q. Clearly, 1 and /2 span K over Q. If they were linearly dependent, then 
/2 would lie in Q, which is not the case. Thus, [K : Q] = 2. Let L = {c +dvV3: 
c,d € K}. We claim that L is a subfield of R and, hence, an extension of K. All 
of the subfield properties are easy to check except perhaps the existence of inverses. 
Let 0 £4 c+ dvV3 € L. Then (c + dV3)(c — dV3) = c? — 3d. Suppose that 
this is 0. Then c — dV3 = 0. If d = 0, then so is c, giving us a contradiction. 
Otherwise, /3 =cd"e K. Thus, we can write a + b/2 = af. witha,be Q. 
Then a? +2b?+2ab/2 = 3. If b = 0, then /3 = a € Q, whichis not true. Ifa = 0, 


then /3 = b € Q. But then 2x? — 3 has a rational root which, by Theorem 11.5, 


is not the case. Thus, ab 4 0, and J/2€ Q, giving us a contradiction. Therefore, 


(c+ d/3)"! = pane € L. Now, | and 3 span L over K. If they were linearly 


dependent, then we would have /3 € K which, as we have just seen, is not the case. 
Therefore, [L : K] = 2. By the theorem above, [L : Q] =[L: K][K : Q] =4. 


One particular type of extension is especially important. 


Definition 12.10. Let K bea field extension of F. If a € K, then we write F(a) for 
the intersection of all subfields of K containing F anda. We say that K is a simple 
extension of F if K = F(a) forsomea € K. 


By Exercise 8.33, the intersection of some set of fields is a field. Thus, F(a) is 
always a field. Indeed, it is the smallest subfield of K containing F and a. 


Example 12.23. By Example 8.30, {a + bV/2 : a, b € Q} is a subfield of R. Thus, 
since any field including Q and J/2 would surely contain this field, it is Q(V/2). 


Example 12.24. Ina similar manner, we note that Q(/2) would have to contain «/2 
and (./2)?. Example 12.21 shows us that Q(./2) = {a+ bV2+cV/4: a,b,c € Qh. 


Let us concentrate on simple extensions. In fact, we need to break them down into 
two types, depending upon one specific property of the element a. 


Definition 12.11. Let K be a field extension of F anda € K. We say that a is 
algebraic over F if there exists anonzero polynomial f(x) € F[x] such that f(a) = 
0. Otherwise, a is transcendental over F. 


Example 12.25. The number V2 is algebraic over Q, since it is a root of x7 — 2 € 


Qi]. 
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Example 12.26. The number /2 + V3 is algebraic over Q, since it is a root of 
x*— 10x? + 1. 


Finding examples of real numbers that are transcendental over Q is a bit tricky. As 
it happens, the constants e and z are both transcendental. (This is a difficult result. 
For a proof, see the advanced monograph of Baker [1].) Of course, the underlying 
field is important! If we let F = Q(z), then z is algebraic over F, as 7 is a root of 
x? — mn? € Fix]. 

We are primarily interested in algebraic elements. However, we can mention one 
important fact about transcendental elements. If F is a field, then F'[x] is an integral 
domain, and so we can consider its field of fractions. Denote this field of fractions 
by F(x). 


Theorem 12.9. Let K be an extension field of F, and let a € K be transcendental 
over F. Then F(a) is isomorphic to F(x). In particular, F(a) is of infinite degree 
over F. 


Proof. Define a: F[x] — K viaa(f(x)) = f(a). By Lemma 11.1, a is a homo- 
morphism. If f(x) € ker(@), then f(a) = 0. Since a is transcendental, f(x) is the 
zero polynomial. Thus, a is one-to-one, and F[x] is isomorphic to a(F[x]). Also, 
f(a) € F(a) for all f(x) € F[x]; thus, a(F[x]) is a subring of F(a). By Theorem 
9.15, there is a subfield L of F(a) such that L is isomorphic to F(x) and contains 
a(F[x]). Clearly a(b) = b for all b € F and a(x) = a; thus, a(F[x]) contains 
both F and a. But F(a) is the smallest subfield of K containing both F and a; thus, 
F(a)=L. 

Suppose that [F(a) : F] =n < oo. Then according to Lemma 12.1, the elements 
1,a,a’,..., a" must be linearly dependent over F’. But then there exist c; € F, not 
all zero, such that a is a root of cp + cix +--+ + c,x". That is, a is algebraic, giving 
us a contradiction. 


Now suppose that a is algebraic over F. We know that it satisfies a nonzero 
polynomial in F [x]. But one particular such polynomial is key. 


Definition 12.12. Let K be an extension field of F and let a € K be algebraic over 
F. Then the minimal polynomial of a over F is the monic irreducible polynomial 
m(x) € F[x] such that m(a) = 0. 


Example 12.27. The minimal polynomial of \/2 over Q is x* — 2. Indeed, V2 is a 
root, and the polynomial is irreducible by Example 11.10. 


Example 12.28. The minimal polynomial of /2 + /3 over Q is x* — 10x? + 1. 
As we noted in Example 12.26, /2 + V3 is a root. Suppose it were reducible over 
Q. The Rational Roots Theorem shows us that it has no roots in Q. Thus, it would 
have to factor as a product of two polynomials of degree 2. By Theorem 11.4, these 
polynomials may be assumed to be in Z[x]. Looking at the coefficients, we see 
immediately that (up to multiplying both factors by —1) the only possibilities are 
(x? + ax + 1)(x? — ax + 1) and (x? +. ax — 1)(x? — ax — 1), for some a € Z. But 
then 2 — a* = —10 or —2 — a? = —10. Neither of these has a solution in Z. 
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We were a bit bold in our definition of the minimal polynomial. Indeed, we 
assumed that such a polynomial exists, and that there is only one. Fortunately, our 
presumptuousness was justified; in fact, we can say more. 


Theorem 12.10. Let K be an extension field of F, and leta € K be algebraic over 
F.. Then 


I. the minimal polynomial m(x) of a over F exists, and is the unique monic poly- 
nomial of smallest degree in F[x] of which a is a root; and 


2. if f(x) € F[x], then f(a) = 0 if and only if m(x)| f (x). 


Proof. Let I = {f(x) € F[x] : f(a) = O}. We claim that J is an ideal of F[x]. 
Surely 0 € J. If f(x), g(x) € J, then f(a) — g(a) = 0, and hence f(x) — g(x) € I. 
Also, if h(x) € F[x], then f(a)h(a) = 0, and hence f(x)h(x) € J, proving the 
claim. 

We know that F[x] is a Euclidean domain and hence, by Theorem 10.8, a PID. 
Thus, let J = (m(x)). Since a is algebraic, m(x) is not the zero polynomial. As 
(m(x)) = (cm(x)) if 0 ~ c € F, we may as well assume that m(x) is monic. 
Now, f(x) € J if and only if m(x)| f(x), as required by (2). As such, deg(m(x)) < 
deg f(x), unless f(x) = 0. If deg(m(x)) = deg(f(x)), then f(x) is simply m(x) 
multiplied by an element of F’. If f (x) is also monic, then f(x) = m(x). Thus, m(x) 
satisfies condition (1) as well. 

We must still establish that m(x) is actually the minimal polynomial of a over F. 
To demonstrate this, we must show that m (x) is irreducible. Butifm(x) = f(x)g(x), 
with f(x), g(x) € F[x], then 0 = m(a) = f(a)g(a). Thus, f(a) = 0 or g(a) = 0. 
Without loss of generality, say f(a) = 0. Then m(x)| f(x). But also f(x)|m(x). It 
now follows that deg( f (x)) = deg(m(x)), and hence g(x) is a constant polynomial. 
Thus, m(x) is irreducible, and hence a minimal polynomial for a. If g(x) is another 
minimal polynomial, then g(x) € J, and hence m(x)|g(x). But g(x) is irreducible, 
and therefore g(x) = cm(x) for some c € F. As m(x) and g(x) are both monic, 
m(x) = g(x), and the proof is complete. 


We can use the minimal polynomial to describe the simple extension. 


Theorem 12.11. Let L be an extension field of F, and let a € L be algebraic over 
F.. If m(x) is the minimal polynomial of a over F, letn = deg(m(x)). Then 


I. (F(a): Fl =n; 
2. {l,a, a*,..., a"—Visa basis for F(a) over F; and 
3. F(a) is isomorphic to F[x]|/(m(x)). 


Proof. Of course, (1) follows immediately from (2), so let us prove (2). Suppose 
that 1,a,...,a"~' are linearly dependent. Then there exist cg, ..., Cn_1 € F, notall 
zero, such that cp + cja +--+ +c,—1a"~! = 0. That is, a is aroot of cp +cyx +--+ -+ 
Cn_1x" |. But this polynomial has degree smaller than that of m(x), contradicting 
Theorem 12.10. Thus, 1,a,...,a”~! are linearly independent. 

We claim that they span F(a). We know that F(a) is the smallest field containing 
F and a (and, hence, all of the a‘). Therefore, it is sufficient to show that K = 
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{co tc1a +--+ + c,_-1a""! 1c, € F} isa field. Clearly, it contains | and is closed 
under subtraction. To show that it is closed under multiplication, it is enough to 
show that a' € K for all positive integers i. Our proof is by strong induction upon 
i. Ifi <n, there is nothing to do, so let i > n and suppose it is true for smaller 


n,i-n 


exponents. Writing m(x) = bo + --- + by_yx""'! + x", we have a’ = a = 
(—bo — bia — +++ — by_,a""')a‘—". But this is a linear combination of terms of 
the form a/, with j < i. Thus, by our inductive hypothesis, a’ € K. Finally, we 
must check that every nonzero element of K has an inverse in K. But a nonzero 
element of K has the form f(a), for some 0 4 f(x) € F[x], with deg(f(x)) <n. 
Now, (f(x), m(x))|m(x). As m(x) is irreducible, (f(x), m(x)) is either 1 or an 
associate of m(x). However, deg(f (x)) < deg(m(x)). Thus, (f(x), m(x)) = 1. By 
Theorem 10.6, there exist u(x), v(x) € F[x] such that f(x)u(x) + m(x)v(x) = I. 
Since m(a) = 0, we have f(a)u(a) = 1. Furthermore, as we noted above, u(a) € K, 
so f(a) has an inverse in K. Therefore, K is a field, and (2) is proved. 

(3) Define a : F[x] — F(a) viaa(f(x)) = f(@. By Lemmal1.1, @ is a 
homomorphism. In view of (2), it is onto. The kernel is the set of all polynomials 
in F[x] of which a is a root. By Theorem 12.10, this is (m(x)). Apply the First 
Isomorphism Theorem. 


Example 12.29. As x? + 1 is the minimal polynomial of i over R, we see that 
C = R() is isomorphic to R[x]/(x* + 1). 


Example 12.30. As we saw in Example 12.28, the minimal polynomial of /2 + /3 
over Q is x* — 10x? + 1. Therefore, Q(./2 + J/3) is isomorphic to 


Q[x]/(x* — 10x? + 1). 


Furthermore, letting a = eee V3, the elements of Q(a) are precisely co + cja + 
oa* + c3a?, with c; € Q. Addition works in the obvious way. To demonstrate 
multiplication, let us try (2 — 3a + 4a”)(5 + a — 6a? + 2a). We get 10 — 13a + 
5a? + 26a? — 30a* + 8a°. Now, a* = 10a” —1 anda® = 10a? —a. Thus, our product 
is 40 — 21a — 295a + 106a3. 


Our last theorem has an interesting immediate consequence. 


Corollary 12.1. Let K be an extension field of F. Ifa, b € K, anda and b have the 
same minimal polynomial over F, then F(a) is isomorphic to F (b). 


Proof. If m(x) is the minimal polynomial, then by Theorem 12.11, both F(a) and 
F(b) are isomorphic to F[x]/(m(x)). 


Example 12.31. Let w be a primitive cube root of unity in C. (That is, #* = 1 but 
@ # 1.) Then 2 and w,/2 are both roots of x3 — 2 € Q[x]. As we have observed, 
x> — 2 is irreducible over Q, so it is the minimal polynomial of both ¥/2 and w,/2. 
Thus, Q(./2) is isomorphic to Q(wV/2). These fields are clearly distinct, as Q(/2) 
is a subfield of R, but w/2 ¢ R. 
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Exercises 


12.21. Find the minimal polynomial of V5 + V7 over Q. 
12.22. Find the minimal polynomial of /3 + /9 over Q. 


12.23. Let K be a finite extension field of F. Show that every element of K is 
algebraic over F’. 


12.24. Let K be an extension field of F and L an extension field of K. Ifa € L is 
algebraic over F’, show that [K (a): K] <[F(a): F]. 


12.25. Suppose that we have subfields F,, of K with F; C Fy C F3 C---. Show 
that J, Fy, is a field. 


12.26. For each positive integer n, let a, = V2.1 K = cae Q(a,), show that K 
is an infinite field extension of Q, but every element of K is algebraic over Q. 


12.27. Let K bea field extension of F. Show that for every a € K, F(a”) C F(a). 
Also, give an explicit example illustrating that we do not have F(a”) = F(a) in 
general. 


12.28. Let K bea field extension of F anda € K. Show that a is algebraic over F 
if and only if a? is algebraic over F. 


12.29. Let K be an extension field of C. If a € K is algebraic over C, show that 
aéeC. 


12.30. Let K be a finite extension of F’. If R is a subring of K containing F’, show 
that R is a field. 


12.4 Splitting Fields 


Let us now take a slightly different perspective from the preceding section. Given a 
field F’, instead of looking at elements of extension fields and finding their minimal 
polynomials, let us instead take a nonconstant polynomial f(x) € F[x] and see if 
we can find a field containing F and a root of f(x). For instance, suppose that we 
only knew about the rational numbers, and we wanted to construct a field having a 
root of x? — 2. 


Definition 12.13. Let F bea field and let f(x) € F[x] be anonconstant polynomial. 
If K is an extension field of F, then we say that f(x) splits over K if there exist 
a,a\,...,4, € K such that f(x) = a(x — a,)--- (x —a,). In particular, K is a 
splitting field for f(x) if f(x) splits over K, and if L is any subfield of K with 
F CLC K, then f(x) does not split over L. 
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To put this another way, if K is an extension field of F and bj,...,b, € K, write 
F(b,,...,b,) for the intersection of all subfields of K containing F and all of the 
b;. If f(x) € F[x] splits over K, and aj,..., a, are the roots of f(x) in K, then K 
is a splitting field of f(x) if and only if K = F(qj,..., dy). 

But how to construct such a field? The following observation is helpful. 


Lemma 12.2. Every nonzero prime ideal in a PID is maximal. 


Proof. Let I be a nonzero prime ideal in a PID R. Then J = (a), forsomea € I. By 
Lemma 10.2, a is prime. In particular, by Theorem 10.10, a is irreducible. Since J is 
prime, J # R. Suppose that J is an ideal of R with J C J € R. Let J = (b). Then 
a € (b), so bla. As ais irreducible, b is a unit or an associate of a. In the former case, 
J = R. In the latter, a|b, and hence J = /. Either way, we have a contradiction. 


The next lemma is the key to our construction. 


Lemma 12.3. Let F be a field and f(x) an irreducible polynomial in F [x]. Let 
K = F[x]/(f(a)). Then K is a field containing (an isomorphic copy of) F and a 
root a of f (x). In fact, K = F(a). 


Proof. We know that F[x] is a Euclidean domain and hence, by Theorem 10.8, a 
PID. By Theorem 10.11, f(x) is prime. Thus, by Lemma 10.2, (f(x)) is a prime 
ideal. The preceding lemma tells us that (f (x)) is maximal. By Theorem 9.20, K is 
indeed a field. Definea : F > K viaa(b) = b+ (f (x)). It is immediate that a is a 
homomorphism. If a(b) = 0, then b € (f(x)), which means that f(x)|b. As bisa 
constant, b = 0, and hence a is one-to-one. Thus, K contains an isomorphic copy of 
F, namely a(F). Finally, let us show that K contains a root of f(x). But this root is 
a =x+(f(x)). Indeed, f(a) = f(x) + (f(x) = 0+ (f(a), as required. Clearly, 
F(a) would have to contain x” + (f(x)) for alln > 0. Thus, K = F(a). 


Let us combine the preceding lemma with Theorem 12.11. We see that if f(x) € 
F [x] is irreducible of degree n, then the field K has, as a basis over F,, the terms 
x! + (f(x)), with 0 < i < n. This allows us for the first time to create finite fields 
other than Z,, where p is a prime. 


Example 12.32. Suppose we wish to construct a field of order 125. In view of 
Theorem 12.7, we would need an extension of degree 3 of Zs. Consider f(x) = 
x3 4 3x2 +x +42 € Zs[x]. By Corollary 11.2, it is irreducible over Zs if it has 
no roots in Zs. There are only five possible roots, and none of them work. There- 
fore, f (x) is irreducible and F'[x]/(f(x)) is a field of order 125. The elements are 
ao + ayx 4 anx? + (f (x)), with a; € Zs5. Addition works in the obvious way. As an 
example of multiplication, we have (letting J = (f(x))) 


(24+ 4x4+ 3x7 + D1+4e4+ 1) =24+2x4+ 40742747 


94-0 ae $93 Se =D 
= 343x741. 
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We can now construct splitting fields. 


Theorem 12.12. Let F be afield and f(x) € F(x] anonconstant polynomial. Then 
there is a splitting field of f (x) over F. 


Proof. First, let us prove the existence of a field extension in which f(x) splits. We 
proceed by induction on n = deg(f(x)). If nm = 1, then F will suffice. Assume 
that n > 2 and the n — 1 case holds. We know that F[x] is a UFD. Thus, write 
f(x) = g1(%)--- gx(x), where the g;(x) are irreducible in F[x]. By Lemma 12.3, 
there is an extension field K of F in which gj (x) has aroot, a. Then by Theorem 11.2, 
gi(x) = (x — a)h,(x), for some h(x) € K[x]. Thus, in K[x], we have f(x) = 
(x — a)hy(x)g2(x) -- + gg (x). Now, hi (x) g2(x)--- gx (x) has degree n — 1. Thus, by 
our inductive hypothesis, it splits in some extension field L of K. Hence, L is an 
extension field of F, and f(x) splits over L. 

Let us write f(x) = b(x — bj)---(x« — by), with b,b),...,b, € L. Then 
F(b,,..., bn) is a splitting field for f(x) over F. 


But we can go one step further. We want to show that splitting fields are unique 
up to isomorphism. (The proof is a bit technical, but the result will pay dividends 
when we classify the finite fields.) To this end, we need to sharpen Corollary 12.1 a 
bit. Ifa : R — Sis aring homomorphism, and f(x) = co +--+ + cnx” € R[x], 
then we write a(f(x)) = a(co) +--- + a(cy)x” € S[x]. 


Lemma 12.4. Leta: F — K be an isomorphism of fields. Let f (x) € F[x] be 
an irreducible polynomial. Suppose that a is a root of f (x) in some extension field 
of F and b is a root of a(f (x)) in some extension field of K. Then there exists an 
isomorphism B : F(a) — K(b) such that B(c) = a(c) for allc € F and B(a) = b. 


Proof. Define y : F[x] — F(a) via y(g(x)) = g(a). By Lemma 11.1, y is a 
homomorphism. By Theorem 12.10, ker(y) = (f(x)). (We assumed that f(x) was 
monic in that theorem, but that is immaterial here.) In view of Theorem 12.11, y 
is onto. Thus, the proof of the First Isomorphism Theorem shows us that the map 
p: F[x]/(f(@)) — F(a) given by p(g(x) + (f(*))) = g(a) is an isomorphism. 
We also note that if c € F, then p(c + (f(x))) = c and p(x + (f(x))) = a. 
In precisely the same manner, the map t : K[x]/(a(f(x))) — K(b) given by 
t(h(x) + (a(f(x)))) = h(b) is an isomorphism, t(d + (a(f(x)))) = d for all 
dé K andt(x+ (a(f(x)))) =b. 

Now, the function from F [x] to K[x] mapping each u(x) to w(u(x)) is easily seen 
to be an isomorphism. Composing that with the obvious homomorphism from K [x] to 
K[x]/(a@(f («))), we obtain a homomorphism from F[x] onto K[x]/(a@(f (x))) with 
kernel (f (x)). In view of the First Isomorphism Theorem, we have an isomorphism 


o : Fl[x)/(f(x)) > K[x]/(@(f(x))) given by 
a (u(x) + (f(x))) = a(x) + (@(f (*))). 


Notice that o(c + (f(x))) = a(c) + (a(f (x))) for alle € F, ando(x + (f(x))) = 
x + (a(f(x))). 
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From Theorem 9.12, we learn that top! 


Furthermore, if c € F’, then 


: F(a) — K(b) is an isomorphism. 


top '(c) = toc + (f(x))) = tac) + @(f()))) = ac), 


and 


top ‘(a) = to(x + (f(x))) = te + (@(f (a) = b. 


1 


Letting 8 = top“, we are done. 


This allows us to prove the uniqueness of splitting fields. 


Theorem 12.13. Leta: F — K bea field isomorphism, and let f(x) € F[x] bea 
nonconstant polynomial. If L is a splitting field of f (x) over F, and M is a splitting 
field of a(f (x)) over K, then there is an isomorphism B : L — M such that B and 
a agree on F. 


Proof. We proceed by induction on n = deg(f (x)). Ifn = 1, then we can only have 
L= F andM = K. Thus, letting 6 = a will suffice. Assume that the result is true 
for polynomials of degree n — 1. As f(x) is a product of irreducibles in Fx], let 
us say that f(x) = g(x)h(x), where g(x) is irreducible and h(x) € F[x]. Let a be 
a root of g(x) in L and b a root of a(g(x)) in M. By the preceding lemma, there is 
an isomorphism y : F(a) — K(b) such that y agrees with a on F and y(a) = b. 
We have f(x) = (x — a)u(x), for some u(x) € F(a)[x], by Theorem 11.2. Also, 
y(f(x)) = & — y(a))y u(x) = (& — b)y (u()) in K (6) [x]. Now, L is a splitting 
field for u(x) over F(a) and M is a splitting field for y(u(x)) over K(b). Since 
deg(u(x)) = n — 1, our inductive hypothesis completes the proof. 


Corollary 12.2. Let F be a field and f (x) € F[x] anonconstant polynomial. Then 
any two splitting fields of f (x) over F are isomorphic. 


Proof. In the preceding theorem, let a : F — F be the identity automorphism. 


Exercises 


12.31. Construct an extension field F of Z7 having order 7°. In particular, if F = 
Z7(a), what do all of the elements of F look like? To which of these elements is 
(a? + 5a + 4)(3a? + 6) equal? 


12.32. Construct an extension field F of Z3 having order 81. In particular, if F = 
Z3(a), what do all of the elements of F look like? To which of these elements is 
(a? + 2a? + 2)(2a7 +a + 1) equal? 


12.33. Show that Q(/2, w) is a splitting field of x* — 2 over Q, where w € C, 
w = 1, but @ # i. 


12.34. Let F bea fieldand f(x) € F[x] anonconstant polynomial. If K isa splitting 
field of f(x) over F and L is any extension field of F, suppose thata : K > Lisa 
homomorphism satisfying a(c) = c for allc € F.Ifa € K is aroot of f(x), show 
that a(a) is also a root of f(x). 
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12.35. Find every automorphism of Q(/2). 


12.36. Construct a splitting field for x* + 2x + 1 over Z3. Show that it has degree 
3 over Zs. 


12.37. Let F be any field and f(x) € F[x] a nonconstant polynomial. If we let 
g(x) = f(x + 1), show that f(x) and g(x) have the same splitting fields over F. 


12.38. Let F be a field and f(x) € F[x] a polynomial with deg(f(x)) =n € N. 
Show that f(x) has a splitting field K over F with [K : F] <n!. 


12.5 Applications to Finite Fields 


Let us see what we can deduce about finite fields. If F is a finite field, we know that 
its prime subfield must be isomorphic to Z,, for some prime p. By Theorem 12.7, 
F must have order p”, for some positive integer n. We will construct a field of order 
p” and show that, up to isomorphism, there is only one such field. 

The following concept looks suspiciously like calculus, but is not. 


Definition 12.14. Let F bea field and f(x) = ap-+a,x +anx?+-+-+ay,x" € F[x]. 
Then the formal derivative of f(x) is f’(x) = a; + 2a.x +--+ + na,x""!. 


Note that this has nothing whatsoever to do with limits, as limits do not necessarily 
make sense in an arbitrary field. The formula happens to agree with the one used for 
the derivative of real polynomials. We will also not be disturbed by the fact that the 
following lemma extends the similarity to calculus. 


Lemma 12.5. Let F be a field, f(x), g(x) € F[x] anda € F. Then 
1. (af (x))' = af'(x); 

2. (f(x) + g(x)) = f(x) + 8'(x); and 

3. (f(x)g(x)y = f’a)g(x) + fa’). 


Proof. The first two parts follow immediately from the definition. The third is left 
as Exercise 12.40. 


Definition 12.15. Let F be a field, f(x) € F[x] anda ¢€ F. We say that a isa 
multiple root of f(x) if (x — ay FG). 


Example 12.33. In Q[x], 2 is a multiple root of x? — 4x4 + 7x3 — 7x? — 8x + 20, 
since the polynomial factors as (x — 2)? (x3 + 3x +5). 


Theorem 12.14. Let F be a field, f(x) € F[x] and leta € F bea root of f(x). 
Then a is a multiple root of f (x) if and only if f'(a) = 0. 
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Proof. Suppose that a is a multiple root of f(x), say f(x) = (x — a)*g(x), with 
g(x) € F[x]. Then by Lemma 12.5, f’(x) = 2(x — a)g(x) + (x — a)*g'(x). 
Thus, f’(a) = 0. Conversely, suppose that f’(a) = 0. By Theorem 11.2, f(x) = 
(x—a)h(x), forsomeh(x) € F[x].Thus, f’(x) = h(x)+(x—a)h' (x). As f'(a) = 0, 
we have 0 = h(a) + (a — a)h'(a) = h(a). By Theorem 11.2, (x — a)|h(x), and 
hence (x — ay*| $x). 


Corollary 12.3. Let F be a field and let f(x) € F[x] be irreducible. Let K be a 
splitting field of f (x) over F. If f (x) has a multiple root in K, then f'(x) is the zero 
polynomial. 


Proof. Let a be the multiple root. Then (multiplying f(x) by a suitable element of 
F to make it monic), we see that f(x) is the minimal polynomial of a over F. By 
Theorem 12.10, f(x)| f(x). But if f’(x) 4 0, then deg(f’(x)) < deg(f(x)), which 
is impossible. Therefore, f’(x) is the zero polynomial. 


Definition 12.16. A field F is said to be perfect if no irreducible f(x) € F[x] has 
multiple roots in any splitting field of f(x) over F. 


We digress from our discussion of finite fields to mention the following. 
Theorem 12.15. Every field of characteristic zero is perfect. 


Proof. If f(x) = do +--+ + a,x", witha, #4 Oandn > 1, then f’(x) = a, +---+ 
na,x"—| has leading coefficient na, 4 0. Thus, f’(x) is not the zero polynomial. 
Apply Corollary 12.3. 


Actually, finite fields are perfect too! Let us see why. 


Lemma 12.6. Let F be a finite field of characteristic p. Then the function a : F > 
F given by a(a) = a? is an automorphism. 


Proof. Since F is commutative, a(ab) = (ab)? = a?b? = a(a)a(b), for alla, b € 
F.. By Theorem 8.14, a(a+ b) = (a+b)? =a? +b? =a(a)+a(b). Ifa? = 0, 
then since F is a field, a = 0. Thus, @ is one-to-one. Since F is finite, a is onto as 
well. 


Theorem 12.16. Every finite field is perfect. 


Proof. Suppose that F has characteristic p. Let f(x) € F[x] be irreducible. Suppose 
that f(x) = ag + ayx +---+ a,x". If f(x) has multiple roots in a splitting field, 
then by Corollary 12.3, f’(x) = 0. Thus, ka, = 0, forl < k <n. If p { k, 
then as (k, p) = 1, we may write ku + pv = 1, for some u,v € Z. Therefore, 
a, = uka, + pva, = 0+ 0 = 0. Thus, 


f(x) =dao+ Apx? + 2px? free t Ampx""?. 


In view of the preceding lemma, there exist b; € F such that bP = jp. But now 
Theorem 8.14 tells us that 
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(by + dix + byx? +--+ Dy x™)? = bE + Px? +--+ bP x 
= do + Apx? +... + Ampx""? 


= f(x). 


That is, f(x) is reducible. This contradiction completes the proof. 


What would an imperfect field look like? Clearly, it would have to be an infinite 
field of prime characteristic. Exercise 12.44 shows how to construct an imperfect 
field. 

Back to the finite fields! 


Lemma 12.7. Let F be a field of prime characteristic p and let n be a positive 
integer. If K = {a € F : a” =a}, then K is a subfield of F. 


Proof. See Exercise 8.40. 


Theorem 12.17. Let p be a prime and n a positive integer. Then a field F has order 
p" ifand only if it is a splitting field of x?" —x over the prime subfield, (an isomorphic 
copy of) Zp. 


Proof. Let F have order p”. Then U(F) has order p” — 1. Thus, if0 # a € F, then 
a?’—! = 1, and hence a?’ —a = 0. Clearly, 0?" —0 = Os well. Thus, every element 
of F is a root of x?" — x. By Corollary 11.3, x?" — x can only have p” roots. Thus, 
x?" — x splits over F, and surely it cannot split over any smaller field, as all of the 
roots must be present. Therefore, F is a splitting field of x?" — x over Z Bs 
Conversely, let F be a splitting field of x?" — x over Z p- By Lemma 12.7, the 
roots of x?" — x forma subfield K of F. Since x?" — x splits over K, we must have 
F = K. Furthermore, the formal derivative of x?" — x is —1, which has no roots. 
Therefore, by Theorem 12.14, x?" — x has no multiple roots. In particular, | F| = p”, 
as required. 


Theorem 12.18. [fk is a positive integer, then there is a field of order k if and only if 
k = p" for some prime p and positive integer n. All fields of order p” are isomorphic. 


Proof. By Theorem 12.7, a finite field must have order p”. Theorem 12.12 tells us 
that x?" — x has a splitting field over Z p- By Theorem 12.17, this splitting field has 
order p”. But Theorem 12.17 also says that every field of order p” is such a splitting 
field. By Corollary 12.2, these splitting fields are isomorphic. 


The unique (up to isomorphism) field of order p” is called the Galois field of 
order p”. 

We can also determine the subfields of a finite field. In order to do so, we will 
need the following theorem, which is of interest on its own. 


Theorem 12.19. Let F be a field. Then any finite subgroup G of U(F) is cyclic. 
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Proof. Since G isa finite abelian group, Theorem 5.3 tells us that it is a direct product 
of cyclic groups. If all of these cyclic groups have relatively prime orders, then by 
Theorem 5.4, G is cyclic, and we are done. Otherwise, we may assume that G has 
a subgroup (a) x (b), and there exists a prime p dividing the orders of a and b. By 
Cauchy’s theorem, (a) and (b) each contain an element of order p. Thus, G has a 
subgroup isomorphic to Z, x Z)p. But every element of Z, x Z, has order 1 or p. 
That is, we have at least p* roots for the polynomial x” — 1 € F[x], which has degree 
P, giving us a contradiction and completing the proof. 


Theorem 12.20. Let F bea field of order p", for some prime p and positive integer n. 
Then every subfield of F has order p" ,, for some positive divisor m of n. Furthermore, 
for each positive divisor m of n, F has exactly one subfield of order p™, namely 
{a € F:a?” =a}. 


Proof. Let K be a subfield of F. Then K and F have the same prime subfield, (an 
isomorphic copy of) Z,. By Theorems 12.7 and 12.8, 


n=([F:Z,)=[F: K][K : Z)]. 

In particular, if [K : Z,] =m, then |K| = p” and m|n. 

Let m be a divisor of n and let K = {a € F : a?” = a}. By Lemma 12.7, K is 

a subfield of F’. Furthermore, the preceding theorem tells us that U(F) is cyclic of 
order p” — 1. In addition, 


p" =f (p™ _ Dd Ae p a po 4 pe dhs ooo gy, 


Thus, (p” — 1)|(p” — 1). By Corollary 3.3, U(F) has a subgroup G of order p” — 1. 
But every element a of G satisfies a?"—! = 1, and hence a?” =a. ThatisG C K. 
Also, 0 € K, and therefore K has at least p” elements. But every element of K is a 
root of x?” — x, and therefore K can have at most p” elements. 

To prove the uniqueness of this subfield, suppose that L is another subfield of 
F with p” elements. Then U(K) and U(L) are both subgroups of order p” — 1 
in U(F). However, Corollary 3.3 tells us that U(F) has only one such subgroup. 
Therefore, U(K) = U(L). As the unit group of a field consists of everything except 
0, we have K = L, as required. 


Exercises 


12.39. Find the smallest field containing exactly 3 proper subfields. 


12.40. Let F be a field and f(x), g(x) € F[x]. Show that (f(x)g(x))’ = 
f'(x)g(x) + f (x)g’ (x). 


12.41. Let f(x) € Zs[x] be an irreducible polynomial of degree 3. If K is a splitting 
field of f(x) over Zs, show that |K| = 53 or 5°. 


12.42. Let K bea field of order p” for some prime p and positive integer n, having 
subfields F and L of orders p” and p’, respectively. Find the order of FM L. 
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12.43. Let F bea fieldand f(x) € F[x] an irreducible polynomial having a multiple 
root in some extension field of F. Show that char F = p for some prime p, f(x) = 
dy + apx? + Ay px? + +++ + Ampx"? for some a; € F, and that at least one of the a; 
is transcendental over the prime subfield of F. 


12.44, Let Z,[t] be a polynomial ring over Z, and F = Z)(t) its field of fractions. 
Show that the polynomial x? — t € F[x] is irreducible over F, but that it has a 
multiple root in some extension field of F. In particular, conclude that F is not a 
perfect field. 


12.45. Theorem 12.19 tells us that the unit group of a finite field is cyclic. If char 
F 42, show that the unit group of an infinite field is not cyclic. 


12.46. Suppose char F' = 2. Let us prove that the preceding exercise still holds. 
Suppose, to the contrary, that U(F’) is cyclic. Let U(F) = (a). 


1. Show that F = Z>(a). 

2. Ifa is algebraic over Z2, show that F is finite, and we are done. 

3. If a is transcendental over Z2, show that there exists an integer n such that 
a" =a +1, and obtain a contradiction. 


12.47. Suppose we wrote x!*> — x as a product of irreducibles over Zs. Show that 
each of these irreducible polynomials has degree 1 or 3. (Please do not actually write 
the polynomials!) 


12.48. Show that for every prime p and positive integer n, there exists an irreducible 
polynomial of degree n in Z,[x]. 
Reference 
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Chapter 13 M®) 
Public Key Cryptography a 


In this short chapter, we talk a bit about cryptography. First, we discuss some classical 
sorts of private key methods, and their limitations in the modern world. We then look 
at the first public key cryptographic method. 


13.1 Private Key Cryptography 


Countless methods of encrypting messages have been invented over the centuries, and 
we will not attempt to give an exhaustive list here. Let us discuss a few well-known 
codes. 

One ancient method is known as the Caesar cipher. It could not be much simpler! 
Each letter in the alphabet is shifted forward three letters. Thus, A becomes D, B 
becomes E, and Z becomes C. If we wish to send the message HOWDY! then 
our encrypted message is KRZGB. Decrypting the message is equally simple; the 
recipient shifts each letter back three positions. 

This is not a particularly good code. An opponent who knew that we were using 
a Caesar cipher could read any intercepted message instantly. We could complicate 
things a bit by selecting a positive integer k as a key. Instead of shifting letters 3 
positions ahead, we would shift them k positions ahead. Decryption would then be 
a matter of shifting back k positions. We call this an additive cipher. This is better, 
but not much. There are only 26 possible keys (really 25, as one of them will just 
leave the message unencrypted). It would not take an opponent long to try all of the 
possible keys on an intercepted encrypted message, and see which one gives sensible 
text. 

But we can be more sophisticated than that. Let p be any permutation of the set 
of letters of the alphabet. We can then encrypt text by replacing each letter with its 


'The author acknowledges that the circumstances under which this message would need to be sent 
secretly are few and far between. 
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image under p. This is called a simple substitution cipher. For instance, suppose 
we use Table 13.1. 


Table 13.1 Encryption table for a simple substitution cipher 


originaltext |ABCDEFGHIJKLMNOPQRS TUVWXYZ 
encrypted textt|R VC XNOAYWBKUJETDILQMZFS PGH 


Encrypting HOWDY, we obtain YTSXG. To decrypt, we apply the inverse of p. 
To put this another way, we flip the rows of the table and, for the sake of convenience, 
sort by the encrypted letter rather than the original letter, as in Table 13.2. 


Table 13.2 Decryption table for the simple substitution cipher from Table 13.1 


encrypted text|A BCDEFGHIJKLMNOPQRS TUVWXYZ 
originaltext |G JCPNVYZQMKRTEFXSAWOLBIDHU 


Thus, YTSXG decrypts to HOWDY. This is a vast improvement over the additive 
cipher in terms of security, because the number of possible keys is 26!. Even for a 
computer, that is a huge number of permutations to consider. It does come at the cost 
of having a larger key to exchange. Also, if a substantial amount of text is intercepted, 
this cipher is vulnerable to frequency analysis. That is, in English text, some letters 
occur much more frequently than others. For instance, E is by far the most common 
letter, T is the second most common, and so forth. An opponent could look for the 
most common letters in our text and make an educated guess that those represent 
T and E. If another moderately common letter occurs between them frequently, it 
might just be THE. Proceeding in this way, the code could be cracked. 

Can anything be done about this? There is always the one time pad. This is, in 
fact, an unbreakable cipher. It is also quite simple. The key is a string of random 
letters, at least as long as our message to be encrypted. We then assign a numerical 
value to each letter. We let A be 0, B be 1, and so on, letting Z be 25. To encrypt, 
we add the value of the first letter of our message to the value of the first letter of 
our key in Z6. We then do the same with the second letter of our message and the 
second letter of our key, until we reach the end of our message. Each of our sums is 
then converted back to a letter. 

For instance, say we wanted to encrypt HOWDY, and our randomly selected key 
was NCVBT. Now, H is 7 and N is 13, so the sum is 20, which is U. Similarly, O 
and C are 14 and 2 respectively, giving a sum of 16, which is Q. Next, W is 22 and 
V is 21, giving a sum of 17, which is R. Now, D and B give 3 + | = 4, and hence E, 
and Y and T give 24 + 19 = 17, which is R. Thus, our encrypted text is UQRER. 
To decrypt, we subtract the value of the key letter from the corresponding encrypted 
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letter value. For this message, 20 — 13 = 7, 16-2 = 14,17 -—21 = 22,4-1=3 
and 17 — 19 = 24, and we obtain HOWDY. 

Assuming that the key is truly random, is only used once and is kept secret, an 
opponent who intercepts an encrypted message will be unable to determine anything 
more than the length of the message. The difficulty with this cipher is that the partic- 
ipants must have the ability to exchange a very large key secretly. In general, anyone 
who can do that may not need a code! For certain purposes, though, it is ideal. For 
example, if two people are able to meet once, exchange a briefcase full of random 
letters, and then leave for distant cities, they will be able to exchange messages while 
they are apart. 

In the internet age, the problem is that most encrypted messages are sent between 
two distant computers, and the computers can never meet secretly to exchange infor- 
mation. None of the schemes we have discussed are suitable. These are all private 
key methods. That is, the key used must be kept secret. Any opponent who discovered 
it could easily decrypt an intercepted message, since the ability to encrypt implies the 
ability to decrypt. Modern codes use public key schemes. The key can be released 
to an opponent without fear, because in these methods, it is quite possible to be able 
to encrypt and not be able to decrypt. 

The next section is devoted to a discussion of the first such scheme. 


Exercises 


Spaces and punctuation have been deleted from all messages to be encrypted or 
decrypted. 


13.1. Encrypt the following message using a Caesar cipher: 
THETREASUREISBURIEDTWENTY PACESNORTHOFTHEPALMTREE 


13.2. A message in English has been encrypted using an additive cipher. Decrypt it. 
BPMBQUMPIAKWUMBPMEITZCAAIQLBWBITSWNUIVGBPQVOA 


13.3. Let us define a multiplicative cipher as follows. Assign the letters of the 
alphabet numerical values as usual (A is 0, B is 1, Z is 25), and choose a positive 
integer k as a key. Then if a letter with value v appears in the text, encrypt it as kv, 
with the multiplication taking place in Z¢. Which values of k will produce a valid 
cipher? 


13.4. A message was encrypted using a multiplicative cipher, as in the preceding 
problem, with k = 7. Decrypt it. 

WUGCDEGCWERCHCZECRC VAWGANMAWWE 

FEGBUW WEHZCDXENQWHCJUPCHPCASJAWD 


13.5. We establish a simple substitution cipher using the following table. 


original text AABCDEFGH IJKLMNOPQRSTUVWXYZ 
encrypted text VY ZXENAWRIOPC SBDFGHJKLMQTU 


Encrypt the following message: 
TRANSFERTENMILLIONDOLLARSONTUESDAY 
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13.6. Using the same simple substitution cipher as in the preceding problem, a 
message is encrypted. Decrypt it. 
YEJJEGJWGEEWBKGHJBBHBBSJWVSVCRSKJEJBBPVJE 


13.7. Making use of the key 
RQPKDFOCMWODKFJDKSKDFVKQUYCHTISOXETX, 

encrypt the following message with a one time pad. 
THEDOCUMENTSAREHIDDENBEHINDTHERENOIR 


13.8. Making use of the key 
ICOWLDIFNSXZIEEOWPAMWRUSDMFJEJFJIBAWUQH, 

the following message was encrypted with a one time pad. Decrypt it. 
EJSJLQOWLULTVXXCBDUDSY YFYQWHEWLAZSSYQY 


13.2 The RSA Scheme 


The RSA Scheme is a public key cipher first described by Ronald L. Rivest, Adi 
Shamir and Leonard N. Adleman in 1977. In fact, an equivalent system was created 
by Clifford C. Cocks in 1973, but his work was classified and not made public for 
more than two decades. 

For convenience, let us say that Bob will be sending messages to June. In this 
case, it is June who creates the cipher. She selects two large distinct primes p and 
q, and lets n = pq. By Theorem 3.19, g(n) = (p — 1)(g — 1). June chooses a 
number e, with 1 < e < y(n), such that (e, p(n)) = 1. The public key consists of 
the numbers e and n. She sends these to Bob, without worrying about whether they 
are intercepted. She does not, however, tell anyone what p and q are! 

Bob prepares to send a message m which must be an integer with O < m < n. 
(We will discuss how to convert text to this format shortly.) Bob calculates 


m’=a_ (mod n), 
where 0 < a < n. He then sends the encrypted message a to June. 
How does June decrypt? The number e was chosen so that e € U(g(n)). Let d be 
the inverse of e in this group. To put that another way, 
de=1 (mod g(n)). 
But now we have the following theorem. 


Theorem 13.1. Let p and q be distinct primes and n = pq. Ifk = 1 (mod ¢g(n)), 
then for any integer b, we have 


bk=b (mod n). 
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Proof. First suppose that (b, n) = 1. Thenb € U(n). But by Theorem 3.17, |U(n)| = 
y(n). Thus, by Corollary 3.5, b?”) = 1 (mod n). Now, g(n)|(k — 1), and hence 
b‘-! =1 (mod n). Thus, b* = b (mod n). 

Now, suppose that exactly one of {p, g} divides b. Without loss of generality, say 
p|b but g { b. Then b € U(q). As |U(q)| = (gq) = g — 1, we see that bY! = 1 
(mod q). Thus, b-DG-) = 1 (mod q) and hence, as above, bk = b (mod q). 
Also, b = b* = 0 (mod p). Thus, p and q both divide b* — b. Since p and q are 
relatively prime, Corollary 2.3 tells us that n = pq|(b* — b), as required. 

Finally, if both p and q divide b, then b = b‘ = 0 (mod n). 


Therefore, to decrypt, June calculates 


alt= (m°)* =m _ (mod n). 

The scheme works because June is the only one who knows d. In order to calculate 
d, an opponent would need to find y(n). But knowing that means being able to 
calculate p and q (see Exercise 13.10). And it is precisely upon the difficulty of this 
problem that the security of the system rests. To be sure, if Bob and June were foolish 
enough to use n = 143, an opponent would be able to find p and q instantly. But 
what if n had 300 digits? Factoring that into two primes of roughly 150 digits each is 
certainly beyond human abilities, and even for a computer, it is going to take a very 
long time. In theory, the cipher would be breakable. But any system that will take a 
fast computer a trillion years to crack is good enough for most purposes. 

How do we create our messages? Suppose that has d digits. Then we will create 
a message m that is at most d — | digits long, so that m < n. If d — 1 is even, we 
will do this by grouping our message into blocks of i letters; if it is odd, then the 
blocks will have size ?. Use the same values for letters introduced in the previous 
section, but make each letter have two digits. Thus, A is 00, B is 01, and Z is 25. 
Put the numbers from one block together to form a message m. (We have to use 
the two-digit method. If we dropped the leading zeroes, we would not know if 123 
meant 1, 2 and 3 (BCD), 1 and 23 (BX) or 12 and 3 (MD).) If the length of our text 
message does not split evenly into blocks of the appropriate length, then pad out the 
last block with random letters. 


Example 13.1. June decides to create an RSA scheme using the primes p = 113 
and g = 137. (Yes, these are much too small to produce a secure system. However, 
the author is far too lazy to perform calculations using 300-digit numbers, and these 
will suffice for an illustration.) Then n = pg = 15481 and y(n) = (p— 1) (q—-1) = 
15232. How can June find a suitable e? A prime larger than both p and q will certainly 
be relatively prime to y(n). June selects e = 151. She then sends the values of n and 
e to Bob. 

As n has five digits, Bob knows that he must break his message into two-letter 
blocks. Suppose he wishes to send the message HOWDY. As the length is not a 
multiple of 2, he pads it out by adding a Q to the end. Now, HO is 0714, WD is 2203 
and YQ is 2416. To encrypt the first message, Bob calculates 
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71415! = 14628 (mod 15481). 


Next, he calculates 
2203'°! = 2494 (mod 15481) 


and 
2416! = 8498 (mod 15481). 


Bob sends June the messages 14628, 2494 and 8498. 

June must calculate d. Since (e, g(n)) = 1, we know that there exist u,v € Z 
such that eu + y(n)v = 1. Thend will be u (modulo g(n)). The Euclidean algorithm 
shows us how to calculate d, and we find that d = 807. Thus, June calculates 


14628°°7 = 714 (mod 15481), 
249487 = 2203 (mod 15481) 


and 
84988" = 2416 (mod 15481). 


The original message was, therefore, 071422032416, which converts to HOWDYQ. 


We should mention a couple of practical points. First, as any power of 0 is 0 and 
any power of | is 1, the messages 0 and 1 will not be encrypted using any RSA 
scheme. For that matter, since e will always be odd, n — 1 (which is —1 modulo 
n), will not change either. Given our method of encrypting English text, n — 1 will 
not arise, but 0 and | might. Can we do anything about it? Keep in mind that in 
the preceding example, we had n = 15481 but the possible messages would have 
been in the range of 0 to 2525. Would there be any harm in pushing them into the 
range of 2 to 2527? Surely not! Thus, we can agree to add 2 to every message before 
encrypting, and then subtract 2 after decrypting. (Do not do this in the exercises!) 

Another point worth mentioning is that e should be reasonably large. To see why, 
note that in the example above, we could have used e = 3. This would be a problem 
if we sent a relatively small message. For example, if we had m = 6, the encrypted 
message would be 6? = 216. No reduction modulo n takes place! An opponent 
who intercepted the message could simply take the cube root of 216 and recover 
the original message, without knowing anything about p and q. If e is large, we can 
ensure that this is avoided. 

While modern ciphers are more complex than the RSA scheme, their security 
invariably rests upon the fact that it is very difficult to factor large numbers. 


Exercises 


Spaces have been deleted from all messages to be encrypted or decrypted. Where a 
letter is needed to pad out a block, use Q. 
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13.9. Suppose that someone foolishly used numbers as small as n = 1961 and 
e = 43 to create an RSA scheme. Crack the code by determining d. 


13.10. Ifn is a product of two distinct primes, and both n and g(n) are known, show 
how to determine the two primes quickly. Illustrate the method using n = 10961, 
y(n) = 10752. 


13.11. Encrypt the message ALGEBRA using an RSA scheme with n = 17399 and 
e = 149. 


13.12. Encrypt the message ABELIANGROUP using an RSA scheme with n = 
18203 and e = 191. 


13.13. Having set up an RSA scheme using p = 103, gq = 179 and e = 151, we 
receive the following message: 2469, 7093, 14773, 10900, 143. Decrypt it. 


13.14. Having set up an RSA scheme using p = 89, g = 167 and e = 181, we 
receive the following message: 13962, 8768, 7864, 4297, 12341. Decrypt it. 


Chapter 14 ®) 
Straightedge and Compass Constructions =” 


We now apply our knowledge of field extensions in order to answer three questions 
posed by the ancient Greeks. 


14.1 Three Ancient Problems 


More than 2000 years ago, the ancient Greeks performed many geometric construc- 
tions using a straightedge and compass. For our purposes, a straightedge is an 
infinitely long ruler having no markings on it. If we have constructed two points, 
then we can use the straightedge to construct the line passing through those points. 
Furthermore, if we have constructed two points A and B, then for any point C we 
have constructed (which may or may not be distinct from A and B), we can use the 
compass to draw a circle centred at C with radius equal to the distance between A 
and B. 

Next, we can take any two lines, any two circles, or one of each, that we have 
constructed, and construct their points of intersection. Then we repeat! The general 
question is, what can we construct in finitely many steps? 

Let us discuss a few simple examples that will be of use. 


Example 14.1. If we have constructed points A and B, let us construct a perpendic- 
ular bisector to the line segment A B. To this end, construct a circle centred at A with 
radius AB, and a circle centred at B with radius AB. Call the intersection points of 
these circles C and D. Then construct the line through C and D. It is a perpendicular 
bisector of AB, as illustrated in Figure 14.1. 


Example 14.2. Suppose that we have constructed points A and B, and the line pass- 
ing through them. Let us say that we have constructed point C as well, although we 
do not insist that C ¢ {A, B}. We claim that we can construct a line through C that 
is perpendicular to the line through A and B. Without loss of generality, we may 
assume that C and A are distinct points. Construct the circle centred at C with radius 
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Fig. 14.1 Construction of a 
perpendicular bisector of AB 


AC. If it intersects the line through A and B at a single point (which must necessarily 
be A), then the line through A and C will suffice, as illustrated in Figure 14.2. 

Otherwise, suppose that the circle meets the line at points A and D. Then the line 
we are looking for is the perpendicular bisector of A D, which the preceding example 
allows us to construct. See Figure 14.3. 


Example 14.3. Suppose that we have constructed three points A, B and C, that are 
not collinear. The three points must lie on a circle. Let us construct the centre of the 
circle, and hence the circle itself. Using Example 14.1, construct the perpendicular 
bisector of the chord AB. It must pass through the centre of the circle. Similarly, we 
can construct the perpendicular bisector of BC, and it too passes through the centre 
of the circle. Therefore, the point of intersection D of the two lines we have just 
constructed is the centre of the circle, and we can construct the circle itself, as it is 
centred at D and has radius AD. See Figure 14.4. 


For all the remarkable geometric constructions that were performed in antiquity, 
some problems could not be solved at the time. 


Question 14.1. (Squaring the Circle). Given an arbitrary circle, can we construct a 
square having the same area? 


As we shall see, if we are given a square, we can construct another square whose 
area is twice that of the first square. If we extend our constructions into three dimen- 
sions, we have the following. 


Question 14.2. (Doubling the Cube). Given an arbitrary cube, can we construct 
another cube having twice the volume of the first cube? 


If we are given three distinct points A, B and C, then we can construct a bisector of 
the angle ZABC. That is, we can construct a point D such that ZDBC = 5ZABC. 
See Exercise 14.5. This naturally led to the following question. 
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Fig. 14.2. Construction of a 
perpendicular to AB passing 
through C (first case) 
$C 

A B 
Fig. 14.3 Construction of a 
perpendicular to AB passing 
through C (second case) 

eC 
A D B 


Question 14.3. (Trisecting the Angle). Given any three distinct points A, B and C, 
can we construct a point D such that ZDBC = 4ZABC? 


In fact, all three questions have a negative answer, but it was not until the nineteenth 
century that the tools of modern algebra allowed a proof to be given. 


Exercises 


14.1. Suppose that we have points A and B, and the distance from A to B is 1. 
Construct points C and D such that the distance from C to D is 1.5. 


14.2. Suppose that we have points A and B, and the distance from A to B is 1. 
Construct points C and D such that the distance from C to D is J/2. 


244 14 Straightedge and Compass Constructions 


Fig. 14.4 Construction of 


B 
the circle passing through A, ae | 
BandC i 
// Cc 
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14.3. Given two points A and B, construct a point C such that A BC is an equilateral 
triangle. 


14.4. Given two points A and B, construct points C, D and E all lying on the circle 
centred at A and passing through B, such that BC DE is a square. 


14.5. Given three points A, B and C, construct a point D such that ZDBC = 
5ZABC (where ZA BC is assumed to be at most 180°). 


14.6. Given three points A, B and C, construct a point D such that ZDBC = 
2ZABC (where ZABC is assumed to be at most 90°). 


14.7. Given two points A and B, construct points C and D on the circle centred at 
A and passing through B, such that BC D is an equilateral triangle. 


14.8. Suppose we are given three points A, B and C, not collinear. Construct the 
inscribed circle for the triangle ABC; that is, construct a circle that lies inside the 
triangle but intersects each side at a single point. 


14.2 The Connection to Field Extensions 


In order to tackle these problems, we need to be able to discuss them in algebraic 
terms. Let us formalize our procedure. 

We will begin with two points. (Without those, we cannot construct any lines or 
circles, and so we get nowhere.) Let us identify these with the points (0, 0) and (1, 0) 
in the plane. We let P; = {(0, 0), (1, 0)}. Then we proceed as follows. For every 
positive integer i, take all pairs of distinct points A and B in P;, and draw the line 
through A and B. Also, for every pair of distinct points A and B in P;, and for every 
point C in P; (where C may or may not be in {A, B}), draw the circle centred at C 
with radius equal to the distance between A and B. Let Q; be the set of all lines and 
circles obtained in this way. Then let P;+; be the set of all points of intersection of 
any two distinct lines, any two distinct circles, or any line and any circle in Q;. 

We note that each P; and Q; is a finite set, with P; C P;,; and Q; C Qj4, for 
alli. 
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Definition 14.1. A line or circle in the plane is constructible if it is in some Q;. A 
point in the plane is constructible if it is in some P;. A real number r is constructible 
if the point (7, 0) is constructible. 


Let us start by proving what numbers we can construct, and then see what limits 
there are upon constructibility. 


Lemma 14.1. Letr € R. Then the following are equivalent: 


1. r is constructible; 

2. —r is constructible; 

3. the point (0, r) is constructible; and 
4. the point (0, —r) is constructible. 


Proof. If r = 0, there is nothing to do, so assume that r 4 0. 

Suppose that (1) holds. Then let A = (0,0) and B = (r, 0). We can construct 
the circle centred at A with radius AB and the line through A and B (namely, the 
x-axis). They intersect at C = (—r, 0), giving (2). As in Example 14.1, construct the 
perpendicular bisector of BC, which is the y-axis. The circle we constructed above 
intersects it at (O, 7) and (0, —r), giving (3) and (4). By symmetry, (2) implies (1) as 
well. 

If we assume (3), then again, we can construct a circle centred at (0, 0) with radius 
|r|. As we are given (0, 0) and (1, 0), we can construct the x-axis, which intersects 
the circle at (r, 0), giving (1). By symmetry, (4) implies (1) as well. 


Note from the proof that since (0, 0) and (1, 0) are constructible, so are the x- 
and y-axes. 


Lemma 14.2. Let a,b € R. Then the point (a, b) is constructible if and only if the 
numbers a and b are constructible. 


Proof. Suppose that (a, b) is constructible. As in Example 14.2, construct a line 
through (a, b) perpendicular to the x-axis. It intersects the x-axis at (a, 0), and so a 
is constructible. Then construct the line through (a, b) perpendicular to the y-axis. 
It intersects the y-axis at (0, b). Hence, by the preceding lemma, b is constructible. 

Conversely, let a and b be constructible. Then the points (a, 0) and (0, b) are 
constructible. Construct the line perpendicular to the x-axis through (a, 0). Similarly, 
construct the line perpendicular to the y-axis through (0, b). These two lines meet at 
(a, b). 


Theorem 14.1. The constructible numbers form a subfield of R. 


Proof. By definition, 1 is constructible. Suppose that a and b are constructible. We 
would like to show that a + b and a — b are constructible. If b = 0, there is nothing 
to do. Otherwise, construct the circle centred at (a, 0), the radius of which is the 
distance from (0, 0) to (b, 0). It intersects the x-axis at the points (a + b, 0) and 
(a — b, 0). See Figure 14.5. 
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Fig. 14.5 Construction of y 
a+banda—b 


a-b x 


If b ¥ 0, we also need to construct ab~'. As it is sufficient to construct —ab7!, 
we may assume that a > 0 and b > O. But in view of the preceding lemma, we 
can construct the points (—b, 0), (0, a) and (0, —1). As in Example 14.3, we can 
construct the circle passing through these points. Either geometrically or through 
algebraic manipulation (see Exercise 14.10), we can prove that this circle intersects 
the x-axis at (ab~!, 0). See Figure 14.6. 

Thus, ab“! is constructible. Theorem 8.12 completes the proof. 


As Q is the prime subfield of IR, we now know that every rational number is 
constructible. But we can say more. 


Theorem 14.2. /f a is a positive constructible number, then so is /a. 


Proof. As a and | are constructible, Lemma 14.1 tells us that we can construct 
the points A = (0,a) and B = (0, —1). By Example 14.1, we can construct the 
perpendicular bisector of AB and, hence, its midpoint C = (0, ). Now construct 
the circle with centre C and radius AC. This circle intersects the x-axis at (d, 0) and 
(—d, 0) for some d > 0. Again using Exercise 14.10, we see that d = \/a. Thus, 
,/a is constructible. 


Corollary 14.1. Suppose there exist fields Q = Fo © Fi © -:: © Fy, where 
each F;1, is a quadratic extension of F; and Fy © R. Then every element of Fy, is 
constructible. 


Proof. We noted above that every element of Fo is constructible. Thus, by induction, 
it suffices to show that if every element of a field F is constructible, and[K : F] = 2, 
then every element of K is constructible. Now, if a € K, buta ¢ F, then {1, a} is 
linearly independent over F and hence, in view of Theorem 12.6, a basis for K. In 
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Fig. 14.6 Construction of yn 
ab"! 


particular, K = F(a). By Theorem 12.9, a is algebraic over F' and, in particular, 
Theorem 12.11 tells us that the minimal polynomial has degree 2. Say that it is 
x? + bx +c, with b,c € F. But then a = abi te By the preceding theorem, 
Jb? — 4c is constructible. But now Theorem 14.1 tells us that a is constructible, and 
hence so is every element of F(a) = K, as required. 


Example 14.4. The number J3+ V 2+ 5/15 is constructible. Let Fo = Q, F; = 


Fo(V3), Fo = F\(V15), Fy = Fo(V/2 + 5V15) and Fy = F3(y V2 + 5/15). It is 


clear that each extension is of degree at most 2, since ifa € F, then either /a € F or 
the minimal polynomial of a over F is x7 — a. Furthermore, /3 + V24+ 5/15 € Fy. 


Now, let us try to restrict the sorts of numbers that can be constructed. 


Lemma 14.3. Let F be a subfield of R. Suppose that we have two distinct points A 
and B such that the coordinates of both points lie in F. Then the line through A and 
B has an equation of the form ax + by = c, for some a, b,c € F. If C is any point 
with coordinates in F, then the circle centred at C with radius equal to the distance 
between A and B has an equation of the form (x — d)* + (y — e)* = f, for some 
d,e,feF. 


Proof. Let us say that A = (a),d2), B = (bj, bo) and C = (cj, c2). Then the 
equation of the line is (bz — a)x + (a; —b,) y = a, bz — azb,, and we can see that the 
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coefficients are in F’. Similarly, the equation of the circle is (x — ay + (y- oy = 
(a, — b,)* + (a2 — bo)’, which is of the correct form. O 


Readers familiar with linear algebra will not find the next lemma surprising, as the 
solution to a system of linear equations can be found using only addition, subtraction, 
multiplication and division. 


Lemma 14.4. Let F be a subfield of R. Suppose that we have two lines, ax +by = c 
and dx + ey = f, where a,b,c,d,e, f € F, and that the two lines intersect at a 
single point. Then that point has coordinates in F. 


Proof. If ae = bd, then the lines are parallel (or identical), which is not permit- 


ted. Assume otherwise. Then the point of intersection is (4, af =), and these 
ge— ae— 


coordinates lie in F’. 


Lemma 14.5. Let F be a subfield of R. Suppose that we have a line ax + by =c 
and a circle (x — d)* + (y —e)* = f, witha, b, c,d, e, f € F. If the line and circle 
intersect, then there is a nonnegative number g € F such that the coordinates of the 
intersection point(s) lie in F (\/g). 

Proof. As a and b cannot both be 0, without loss of generality, say a 4 0. Then 


c— 


a2 Substituting into the equation of the circle, we obtain 


2 
(<< -a) +(y-eP af. 


a 


x= 


Simplifying, we obtain an equation of the form uy”? + vy + w = 0, for some 
u,v, w € F. Furthermore, u = a > 0. Then y = svt uw Tey? 4uw <0, 
then the line and circle do not intersect, contradicting our assumption. Thus, let 
g =v’ — 4uw > 0. Then y € F(/g), and as x = aay we see that x € F(,/g) as 
well. 


Lemma 14.6. Let F be a subfield of . Suppose that we have two distinct circles 
(x — a)? + (y — bY = c and (x — d)* + (y —e)* = f, witha,b,c,d,e, f € F. If 
these circles intersect, then there is a nonnegative g € F such that the coordinates 
of the intersection point(s) lie in F(./g). 


Proof. Subtracting one equation from the other, we obtain 
Ca Wdx4+ Cb= 2a fact CSP Hw ae. 


This is the equation of a line unless 2a — 2d = 2b — 2e = 0. But in the latter case, 
the circles have the same centre, meaning that they are identical or do not intersect, 
so We may assume otherwise. Thus, we are now looking at the intersection of a circle 
and a line (with coefficients in F’), and Lemma 14.5 applies. 


Time to put it all together! 
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Theorem 14.3. A real number a is constructible if and only if there exist subfields F; 
of R such that Q = Fo © F, © +++ © Fy, where each F;+, is a quadratic extension 
of F; anda € Fy. 


Proof. One direction of the theorem is given by Corollary 14.1. Let us prove the other. 
Suppose that a is constructible. Referring to the sets P; and Q; from the definition 
of constructibility, let K; be the intersection of all subfields of R containing all of 
the coordinates of the points in P;. Then each K;+; is an extension field of K;. 

We claim that for each i, there exist fields F; withhQ= fy CF, C---C Fy = 
K;, where each F;, is a quadratic extension of F’;. Our proof is by induction on i. 
If i = 1, then P; = {(0, 0), (1, 0)} and hence K; = Q and there is nothing to do. 
Assume that our claim holds for i. Then we have Q = Fo C Fi C++: C Fn = Kj, 
where each Fj; is a quadratic extension of F';. Now, by Lemma 14.3, every line and 
circle in Q; has coefficients in K;. Furthermore, Lemmas 14.4, 14.5 and 14.6 tell us 
that every possible intersection point of two distinct lines, two distinct circles or a line 
and a circle in P, has coordinates either in K; or in K;(b), for some nonnegative 
b € K;. But P; is a finite set, so there are only finitely many values of b being 
used. Let them be b,,...,b,. For each k, 1 < k <n, let Fngk = Finyn—1(J/ bx). 
If Jb, € Fn+e—1, then Fite = Fn+e-1, and we can discard F,,,,. Otherwise, 
x? — b; is irreducible over Fin+k—1, and So Fy,44 is a quadratic extension of Fin44-1, 
by Theorem 12.11. But the last of the F,,,1, is, by definition, K;,,, establishing the 
claim. 

Since a is constructible, (a, 0) € P;, for some i, hence a € K; and the proof is 
complete. 


Corollary 14.2. Leta € R. Ifa is constructible, then a is algebraic over Q and, in 
fact, its minimal polynomial over Q has degree 2” for some nonnegative integer m. 


Proof. Using F; as in the statement of the preceding theorem, we have a € Fy, and 
by Theorem 12.8, 


[Fe : Q)= Ue: FeaallFeaa: Fal: [Fi : Q) = 2". 


Thus, by Lemma 12.1, the numbers 1, a, a’,..., a are linearly dependent over Q. 


That is, a satisfies a nonzero polynomial over Q. By Theorem 12.11, [Q(a) : Q] < co 
and therefore, by Theorem 12.8, 


* =[h: Q=[% : Q@MIQ@) : QI). 


That is, [Q(a) : Q] divides 2‘, and so it is 2”, for some m. By Theorem 12.11, the 
degree of the minimal polynomial of a over Q is 2”. 


Please note that while the condition given in Theorem 14.3 is necessary and 
sufficient for a to be constructible, the condition given in the corollary is not. It is 
possible to find a real number a whose minimal polynomial over Q has degree 4, but 
such that a is not constructible. 
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Exercises 


14.9. Let a and b be nonzero real numbers. If a is constructible and b is not, show 
that neither a + b nor ab is constructible. If c is not constructible, show by example 
that b + c may or may not be constructible. 


14.10. Leta, b,c andd be positive real numbers and suppose that a circle in the plane 
passes through the points (a, 0), (—b, 0), (0, c) and (0, —d). Show that ab = cd. 


14.11. Are the following numbers constructible? 


1 2a H/o 
2. 43473 


14.12. Is ./2 + /4 constructible? 


14.13. Let a be a real root of the polynomial x° — 15x° + 27x+ — 12x3 + 30x? — 
21x + 87. Is a constructible? 


14.14. Let a bea real root of the polynomial x° — 6x*+ 12x? —8. Is a constructible? 


14.3 Proof of the Impossibility of the Problems 


We now have the machinery necessary to answer the three questions from Section 
14.1. First, let us look at squaring the circle. We may as well assume that our two 
initial points are the centre of the circle, (0, 0), and a point on the circle, (1, 0). Thus, 
we can construct the unit circle immediately. Its area is 2. If we were to construct a 
square with area 7, we would need to construct an edge of length ./7. The following 
theorem tells us that we cannot. 


Theorem 14.4. The number ./x is not constructible. 


Proof. If ./m were constructible, then by Theorem 14.1, x would be constructible 
as well. But as we mentioned in Section 12.3, a is transcendental over Q. This 
contradicts Corollary 14.2. 


As we discussed in Section 14.1, doubling a square is possible. Indeed, if we can 
construct a side with length s, then by Theorems 14.1 and 14.2, the number sV/2 is 
also constructible, and this will be the side length of a square with twice the area. 
To deal with the problem of doubling the cube, without worrying about going into 
the third dimension, we may simply suppose that one edge extends between our two 
initial points, and thus has length 1. This would lead to a cube with volume 1. To 
obtain a cube with volume 2, we would need an edge with length V2. This is not 
going to happen. 


Theorem 14.5. The number \/2 is not constructible. 
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Proof. By Example 12.27, the minimal polynomial of »/2 over Q is x* — 2. But then 
Corollary 14.2 tells us that «/2 is not constructible. 


Finally, what about trisecting an angle? Some angles can be trisected (see Exercise 
14.15). But not all. Indeed, we will show that an angle of 60° (or 3 Z, as we will be 
doing a bit of trigonometry) cannot be trisected. In view of ‘Theorems 14.1 and 14.2, 


the numbers 0, 1, 5 and = are all constructible. Thus, by Lemma 14.2, we can 


construct the points A = (. 8), B = (0,0) and C = (1,0). Then ZABC = 3. 
Thus, we do not need to assume anything extra to obtain the angle. If we could find 
a point D such that 7DBC = 7, then we could draw the line through D and B, 
and then intersect with the unit circle centred at B. An intersection point would be 
(cos (Z) , sin (Z)). By Lemma 14.2, this would require cos (3) to be constructible. 
However, the followink theorem dashes any hopes of that. 


Theorem 14.6. The number cos (% 


5 ) is not constructible. 


Proof. For any 0 € R, note that 


cos(3@) = cos(20) cos(@) — sin(20) sin(@) 
= (cos*(@) — sin?(@)) cos(@) — 2 sin*(@) cos(@) 
= cos*(9) — 3 sin’(@) cos(@) 
= cos*(@) — 3(1 — cos*(@)) cos(@) 
= 4cos*(8) — 3cos(6). 


Let 6 = 5. Then cos(36) = 5, and so 
1 3 
5 = 4cos’ (8) — 3 cos(@). 


That is, cos (3) satisfies the polynomial 8x* — 6x — 1. If this polynomial were 
reducible over Q, then by Corollary 11.2, it would have a rational root. By Theorem 


11.3, the only possible roots are +1, 5, i, : But none of these work. Thus, 
8x? — 6x — 1 is ee over Q. Therefore, the minimal polynomial of cos (4) 


over Q is x? — ix =o +. Corollary 14.2 completes the proof. 


Exercises 


All angles are expressed in radians. 
14.15. Show that it is possible to trisect a right angle using straightedge and compass. 


14.16. Show that the angles 2/6 and 27/3 cannot be trisected using straightedge 
and compass. 
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14.17. In the next two problems, we will show that it is impossible to construct an 
angle of 9 = 27/7. If it were possible, then as we are given the points (0, 0) and 
(1, 0), we would also be able to construct (cos(@), sin(@)). In particular, the number 
cos(@) would be constructible. Let us show that it is not. To this end, for each n, 
1 <n < 6, express cos(n@) as a linear combination over Q of cos‘ (@),0<k <3. 


14.18. Let 6 = 2s/7. 


1. Show that cos(@) + sin(@)i is acomplex root of 1+ x + + x8 tutta? + x8, 

2. Show that 1 + cos(@) + cos(2@) + cos(3@) + cos(40) + cos(5@) + cos(60) = 0. 

3. Use the answer to the preceding exercise to show that cos(@) is a root of 8x? + 
4x? — 4x — 1. 

4. Conclude that cos(@) is not constructible. 


14.19. Suppose we are given the vertices A = (0,0), B = (1,0) and C = (cj, c2) 
of a triangle. Show that we can construct points D, E and F such that the triangles 
ABC and DEF are similar, but DE F has twice the area of ABC. 


14.20. Suppose that we are forced to perform our constructions using a straightedge 
and a collapsing compass. That is, any time we lift the compass, it collapses. In 
particular, we cannot directly use it to construct a circle centred at A with radius 
equal to the distance between B and C. All we can do is take two points A and B 
that we have constructed, and construct a circle centred at A and passing through 
B. Show that this does not change the set of constructible numbers in any way. That 
is, show that any number that was constructible before is still constructible using a 
straightedge and collapsing compass. 


Appendix A 
The Complex Numbers 


The complex numbers are an extension of the real numbers. 


Definition A.1. A complex number is a formal expression a + bi, witha, b € R. 
The set of all complex numbers is denoted C. 


We define addition and multiplication on C via 
(a+ bi)+(c+di)=(a+c)+(b+d)i 


and 
(a + bi)(c + di) = (ac — bd) + (ad + bc)i 


for alla, b,c,d ER. 
Example A.1. Observe that (2+3i)+(5—9i) = 7—6i and (2+3i)(5—9i) = 37—3i. 


We identify the real number a with the complex number a+0i. A complex number 
0 + bi, with b € R, is said to be purely imaginary. We simply write bi for such a 
number. In particular, note that i2 = —1. Also, if u = a + bi, write —u = —a — bi. 
Let us summarize a few properties concerning complex addition. 


Theorem A.1. Let u,v,w € C. Then 


u+veC; 

ut+tv=v+u; 
(u+tv)+w=ut+(v4+w); 
u+0O=u; and 

u+ (—u) = 0. 


MRWNDS 


Proof. The calculations are all straightforward. For instance, to show (2), we note 
that 


(a+bi)+(c+di) = (atc)+(b+d)i = (c+a)+(d+b)i = (c+di)+(a+bi). 


The remaining parts are left to the reader. 
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Similarly, we can list some properties of complex multiplication. 
Theorem A.2. Letu,v,w € C. Then 


uveC; 

uv = vu; 

(uv)w = u(vw); 

u(v+w)=uv+uw; 

lu =u; and 

ifu #4 0, then there exists az € C such that uz = 1. 


AwKRWN> 


Proof. Again, all of the calculations in (2) through (5) are straightforward. For 
instance, to prove (3), lett u = a+ bi,v = c+di andw = e+ fi, with 
a,b,c,d,e, f € R. Then 


(uv)w = ((ac — bd) + (ad + bc)i)(e + fi) 
= (ace — bde — adf — bcf) + (acf — bdf + ade + bce)i 
= (a+ bi)((ce — df) + (cf + de)i) 
= u(vw). 


(6) If u = a + bi, then let z = 


a b_: 
a+e abe! 
(Readers who have finished Chapter 3 will realize that Theorem A.1 shows that 
C is an abelian group under addition. Those who have completed Chapter 8 will 
understand that the two theorems combined show that C is a field.) 
Let us discuss a simple example of a way in which the complex numbers differ 
from the real numbers. 


Definition A.2. If z € C and7n is a positive integer, then we say that z is a primitive 
nth root of unity if z” = 1 but z’” # 1 for any positive integer m <n. 


In R, the only roots of unity are | and —1. But we see immediately that in C, we 
have a primitive fourth root of unity, 7. We can say more, however. We will need this 
well-known theorem due to Abraham de Moivre. 


Theorem A.3 (De Moivre’s Theorem). Let 0 € R. Then 
(cos(@) + sin(O@)i)” = cos(n@) + sin(né)i, 


for any positive integer n. 


Proof. We proceed by induction on n. If n = 1, there is nothing to do. Assume that 
the theorem holds for n. Then 


(cos(@) + sin(@)i)"*! = (cos(n@) + sin(n@)i)(cos(@) + sin(6)i) 
= (cos(n@) cos(@) — sin(n@) sin(@)) 
+ (cos(n@) sin(@) + sin(n@) cos(@))i 
= cos((n + 1)8) + sin((n + 1)6)i, 
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as required. 


2a 


Corollary A.1. Letn be a positive integer. Then cos (7) + sin (=) i is a primitive 


nth root of unity in C. 


Proof. By Theorem A.3, 


cos | — } +sin{| — ]1 = COs + sin {| —— ]1, 
n n n n 
for any positive integer m. If m = n, then we obtain cos(27r) + sin(27)i = 1. On 


the other hand, if 1 < m <n, then0O < m < 1, and hence cos (72) #1. 


Example A.2. Letting n = 3, we obtain a primitive cube root of unity, namely, 
=) 4 V3; 
2 ri 


Appendix B 
Matrix Algebra 


Let us discuss a few definitions and basic properties of matrices. The entries in the 
matrices will come from rings and fields. Readers who are not yet familiar with these 
terms can simply assume that the entries are real numbers. 


Definition B.1. Let R be aring and m and n positive integers. Then an m x n matrix 
over R is an array of elements of R with m rows and n columns. If our matrix is A, 
then we write a;; for the (i, j)-entry of A; that is, 


411 412 *** Ain 

a21 422 *** G2n 
A= 

GAm1 Am2 °°" Amn 


Example B.1. If we let 
467 
a= (; 8 :) ; 
then A is a2 x 3 matrix over R. Furthermore, a). = 6 and a2; = 3. 


Definition B.2. Let A and B be m x n matrices over aring R. Then their sum A+ B 
is the m x n matrix C such that cj; = aj; + bj; for alli and j. 


Example B.2.. Working with 2 x 2 matrices over R, we have 


36 46\ (712 
25) =a) 
For any m xn matrix A, we also let — A be the m x n matrix B such that bj; = —aj; 
for all i and j. Furthermore, the m x n zero matrix has every entry 0. We denote 


this matrix by 0. 
Let us list a few properties of matrix addition. 
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Theorem B.1. Let R be a ring, and m and n positive integers. If A, B and C are 
m Xn matrices over R, then 


I, A+ Bisanm xX n matrix over R; 
A+B=B+A4; 
(A+B)+C=A+4+(B+C); 
A+0= A; and 

A+ (—A)=0. 


wm KRWN 


Proof. The first part is contained in the definition. The other parts are all obtained by 
calculating the (i, j)-entry of each side. For instance, to prove (3), we note that the 
(i, j)-entry of (A+ B)+C is (aj; +b;;)+c;i;, whereas the (i, j)-entry of A+(B+C) 
is aj; + (bj; +-c;;), and these are equal. The rest of the proof is left to the reader. 


Anyone who has read Chapter 3 will note that Theorem B.1 implies that the m x n 
matrices over a ring form an abelian group under addition. 


Definition B.3. Let A be an m x n matrix over a ring R. If r € R, then the scalar 
multiple 7 A is the m x n matrix B such that bj; = raj; for alli and j. 


Example B.3. Working with 3 x 2 matrices over R, we have 


1 3 5 15 
5 | 126] = | 60 30 
12 5 10 


Here are a few properties of scalar multiplication. 


Theorem B.2. Let R be a ring andm,n € N. If A and B are m x n matrices over 
Randr,s € R, then 


1. rAisanm xX n matrix over R; 
r(A+ B)=rA+rB; 

. r+s)A=rA+t+sA; and 
r(sA) = (rs)A. 


AWN 


Proof. (1) is clear from the definition. Each of the other parts is proved by calculating 
the (i, j)-entry of both sides of the equation. For instance, the (7, j)-entry of r(A+ B) 
is r(a;j + b;;), whereas that of rA + rB is raj;; + rb;;, but these are the same, 
establishing (2). The rest of the proof is left to the reader. 


If F is a field, then Theorems B.1 and B.2, when combined with the obvious fact 
that 1A = A, show us that the m x n matrices over F form a vector space over F’, 
as discussed in Chapter 12. 

Matrix multiplication is a bit different. 
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Definition B.4. Let R be aring, and let A be ak x m matrix over R, and B anm xn 
matrix over R. Then the product AB is the k x n matrix C such that 


Cij = Gib); + aj2b2j + +++ + dimbmj, 


for alli and j. 


Example B.4. Let 
3415 
aa.) and B= ],2120 
1261 
111119 7 
aB=(4 14 20 4): 


If R is a ring with identity, then we also have the n x n identity matrix /,,, which 
is the n x n matrix A such that a;; = 1 for alli and a;; = 0 if i ¥ j. For instance, 


be matrices over R. Then 


100 
= |010 
001 


For any positive integer n, write M,,(R) for the set of n x n matrices over a ring 
R. 


Theorem B.3. Let R be a ring, n a positive integer, and A, B,C € M,,(R). Then 


AB &€ M,(R); 

(A+ B)C = AC + BC; 
A(B+C)=AB+AC; 

(AB)C = A(BC); and 

if R is a ring with identity, then I, A = AI, = A. 


MARWNS 


Proof. (1) follows from the definition. 
(2) The (i, j)-entry of (A + B)C is 


(aj, + bit )erj + (Giz + Di2)e2j + +++ + Gin + Din)enj, 
whereas the (7, j)-entry of AB + AC is 
(aj1C1j; + j2C2j + +++ + GinCnj) + (bie1j + Di2c2j + +++ + Dinenj), 
and these are equal. 
(3) is similar to (2). 


(4) Through repeated applications of (2) and (3), we can reduce (4) to the case 
where each of A, B and C has at most one nonzero entry. But then it is trivial. 
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(5) Let D = I,,. Then the (i, j)-entry of DA is 


dja); + djzdnj; ++++ + dinAnj = aj. 


Thus, J, A = A. The proof that AJ,, = A is similar. 


As discussed in Chapter 8, we have now proved that if R is a ring, then so is 
M,,(R), for any positive integer n. Furthermore, if R is a ring with identity, then so 
is M,,(R). It is, however, worth mentioning, that M/,,(R) need not be commutative, 
even if R is. For instance, in M>(R), 


GYOm nm colel 


Definition B.5. Let F bea field and n a positive integer. Then a matrix A € M,(F) 
is said to be invertible if there exists a B € M,(F) such that AB = BA = I,. In 
this case, we call B the inverse of A and write B = A~!. 


Example B.5. In M2(R), the matrix 


is invertible, as 


In most linear algebra courses, a couple of different methods of finding the inverse 
of a matrix are presented (often just in M/,, (IR), but the same methods work in M,,(F), 
for any field F’). There is, however, a shortcut for determining if a matrix is invertible. 


Definition B.6. Let F be a field and n a positive integer. If A € M,(F), then the 
determinant of A, det(A), is an element of F defined recursively as follows. If 
n = I, then det((a)1)) = ay. Ifn > 1, then for any 1 <i, j <n, let Ajj € My_-1(F) 
be the matrix obtained by discarding row i and column j of A. Then 


det(A) = ay; det(Aj1) — ai det(A12) + a3 det(A3) — ++» + (-1)"* ain det(Ain). 


Example B.6. In M2(F), we have 


a1 412 
det = 411022 — A)2a2). 
421 422 


Example B.7. In M3(R), let 
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Then 


det(A) = 2 det ((; ‘)) es ((3 *)) Tene ((s »)) 


= 2(—26) — 5(—41) + 3(—23) 
= 84. 


We conclude with the following result. 


Theorem B.4. Let F be a field and n a positive integer. If A, B € M,(F), then 


I. det(AB) = det(A) det(B); and 
2. A is invertible if and only if det(A) 4 0. 


Proof. We will prove the n = 2 case. The general case can be found in standard 


introductory linear algebra textbooks. 
(1) Observe that 


— fay by + ay2b21 ay b12 + ay2b22 
AB= : 
a21b11 + dz2b21 a21b12 + ax2br 


Thus, 
det(AB) = (ai1b11 + 412b21) (aai1b12 + a22b22) — (ai b12 + a12b22) (dai bi + a22b21). 
On the other hand 

det(A) det(B) = (a11422 — 412421) (bib — bi2ba1), 


and these are equal. 
(2) If det(A) # 0, then let 


B = (det(A))7! ( si _) 


—a21 4) 


It is easy to verify that AB = BA = /,; thus, B = A~!. Suppose, on the other hand, 
that det(A) = 0. If AB = hy, then by (1), det(A) det(B) = det(/2) = 1, which is 
impossible. 


Solutions 


Solutions to the odd-numbered problems. 


Problems of Chapter 1 


11 SOT = {3}, SUT = {1,2,3,4)}, S\T = {1,2}, T\S = (4) and Sx T = 
{C, 3), 1,4), (@, 3), 2,4, (3, 3), 3, DI}. 


1.3 Leta e¢ RUT.Thena € RoraecT.IfaeéR,thenas R C S,wehaveae S, 
andhenceae SUT. IfaeT,thenaée SUT. 


15 Takea €e RU(SOT). Thena€ Rorae SOT. IfaeR,thenae RUS 
anda € RUT,soa € (RUS)O(RUT). Ifa € SNOT, thena e€ S and 
a € T. Therefore,a € RUS anda € RUT,soa € (RUS) (RUT). Thus, 
RU(SNT) C (RUS) N(RUT). Conversely, suppose thata € (RUS)N (RUT). 
Ifa € R, thena € RU(SOT). Ifa ¢ R, then asa € RUS, we must have 
a € S and, similarly, a € T. Thus,a ¢€ SMT, and hence a € RU (S/T). That is, 
(RUS)N(RUT)CRU(CSNT). 


1.7 (2,3), (2, 4), (2,5), G, 8). 


1.9 Reflexive? Yes. Ifa € R, thena—a =0€ Q,soapa. Symmetric? Yes. If apb, 
thena—beQ,sob—a=-—(a—b) € Q, and hence bpa. Transitive? Yes. If aob 
and boc, then a — b,b —c € Q. But thena —c = (a—b)+ (b—c) €Q, soapc. 


1.11 (1) A relation is a subset of {1, 2, 3} x {1, 2, 3}. This Cartesian product has 9 
elements, and therefore 2? = 512 subsets. (See Exercise 1.4.) 

(2) A relation p is symmetric provided | 2 if and only if 2o1, 103 if and only 
if 3o1 and 2,3 if and only if 302. In short, we do not get to decide if 201, 31 or 
32, once all the other possibilities are decided. Thus, only 6 of the 9 possible pairs 
remain to be determined, so the total number is 2° = 64. 
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1.13 Reflexivity: As a — a = 3-0, we have a ~ a for all a € N. Symmetry: If 
a ~ b, then a — b = 3k, and hence b — a = 3(—k); thus, b ~ a. Transitivity: 
Suppose a ~ b and b ~ c. Then a — b = 3k, b —c = 31, for some k,/ € Z. Thus, 
a-—c=(a—b)+(b—-c) = 3(k +1); that is, a ~ c. It is an equivalence relation. 
As for the classes, [1] = {1, 4, 7, ...}, [2] = {2,5, 8, ...} and [3] = {3, 6,9, ...}. 


1.15 Reflexivity: As |a| = |a|, we have a ~ a for all a € Z. Symmetry: If a ~ b, 
then |a| = |b|. Therefore, |b| = |a| and hence b ~ a. Transitivity: If a ~ b and 
b ~ c then |a| = |b] = |cl, and hence a ~ c. It is an equivalence relation. The 
classes are [0] = {0}, [1] = {1, —1}, [2] = {2, —2}, and so on. 


1.17 Note that {1} is a subset of {1, 2}, but {1, 2} is not a subset of {1}. Therefore, 
~ is not symmetric, and hence not an equivalence relation. 


1.19 Reflexivity: If (a, b) € R*,then3a—b = 3a—b,s0 (a, b) ~ (a, b). Symmetry: 
If (a, b) ~ (c, d), then 3a—b = 3c—d,so3c—d = 3a—bandhence (c, d) ~ (a, b). 
Transitivity: If (a, b) ~ (c, d) and (c,d) ~ (e, f), then 3a —b = 3c—d = 3e — f, 
and hence (a, b) ~ (e, f). Also, (a, b) € [(4, 2)]ifand only if3a—b = 3.4—2 = 10; 
that is, if and only if b = 3a — 10. Thus, [(4, 2)] = {(a, 3a — 10): a € R}. 


1.21 Let a ~ b if and only if either both or neither of a and b lie in {1, 2, 3}. 
Reflexivity and symmetry are clear. Suppose a ~ b and b ~ c. Ifa é€ {1,2, 3}, 
then b € {1, 2,3} and hence c € {1, 2, 3}. Similarly ifa ¢ {1, 2, 3}. Thus, ~ is an 
equivalence relation. The classes are [1] = {1, 2, 3} and [4] = {4,5,6,...}. 


1.23 Ifa(a) = a(b), then 2a— 1 = 2b—1, and hence a = b. Thus, a is one-to-one. 
But there is no a € {1, 2, 3, 4} such that a(a) = 2, so a is not onto. 


1.25 If a(a) = a(b), then 277° = 232-9, Taking the base 2 logarithm, we have 
3a—5 = 3b—5, and hence a = b. Thus, a is one-to-one. Ifc € T, then a(((log, c)+ 
5)/3) = c. Therefore, a is onto as well. In fact, we will use 8 : T — S given by 
B(c) = (logy c) + 5)/3. We have B(a(a)) = B(2**~>) = (log, (2°) + 5)/3 =a, 
foralla eS. 


1.27 (1) and (3) are binary operations, as ab € N for alla, b € N, and3 € N. But 
(2) is not,asl*2=—1¢€N. 


1.29 Surely 6 is onto. If t € T, then there exists anr € R such that (6a)(r) = ft. 
But then 6(a@(r)) = t. However, a need not be. To see this, let R and S be the set of 
real numbers and let T be the set of nonnegative real numbers. Let a(r) = r? and 
B(s) = s*. Then @ is not onto, as there is no r € R such that a(r) = —1. However, 
if t € T, then B(a(x/t)) = B(./t) = t; thus, Ba is onto. 


1.31 (1) For each of the m elements a of S, there are n possible choices for w(a), 
son™, 

(2) Ifn < m, the answer is 0, as the m elements of S need to map to m different 
places. Suppose n > mand let S = {aj, ..., Gd}. Then there are n choices for a(a;), 
leaving n — | choices for w(a2), and so on. The answer is n(n — 1)---(n —m+ 1). 
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Problems of Chapter 2 


2.1 Apply induction. When n = 1, both sides are 1. Assume the result for n, then 
prove it forn+1:1+---+n+ (n+l) =n(n4+1)/24+(4+)) = (2+ 1I(14+2)/2 = 
(n+ 1)(m + 1) + 1)/2, as required. 


2.3 (1) This is the Binomial Theorem with a = b = 1. 
(2) This is the Binomial Theorem with a = 1,b = —1. 


2.5 (1) By induction on n. We have nothing to prove for n = 1, so we begin with 
n = 2. Here, (1 + a)? =14+2a+a?>1+2a,asa is positive. Assume the result 
for n, and prove it forn + 1. But 1 +a)"*! = (1 +a)"(1+a) > (14+na)(1+a) = 
1+(n+Da+na? >14+ (n+ la,asa > 0. 

(2) Apply (1) with a = (n — 1)/n, then take nth roots. 


2.7 By strong induction on n. If n = 1 or 2, the result is obvious, so assume that 
n > 2 and the result is true for smaller n. Then f, = fp-1 + fn-2 < (7/4)? + 
day = (7/4)"-3(7/4 +1) < (7/4)""1, since 11/4 < (7/4)?. 


2.9 By strong induction on the area a = rc of the bar. If the area is 1, thenr = c = 1 
and no actions are necessary. Suppose the area is a > 1| and the result is true for 
bars of smaller area. Then a break turns the bar into two bars with areas b and c, 
both less than a. By our inductive hypothesis, it will take b — 1 and c — | actions, 
respectively, to break down these two bars. We have already used 1 action, so the 
totalis 1+ (b—1)+(c—1) = (6+c)—1 =a-—1, as required. Alternative solution: 
We must turn | bar into rc bars. Each action adds one bar. So we need rc — | actions. 


O01 Gj 57 S 200) 417. 20: S 703447 S 3S eS 
2 = 1(2) + 0. Thus, (57, 20) = 1. 

(2) 117 = 51(2) + 15; 51 = 15(3) + 6; 15 = 6(2) + 3; 6 = 3(2) + O. Thus, 
(17,51) =3, 


2.13 Let us write b = ac anda = bd, with c,d € Z. Thena = acd; that is, 
a(1 — cd) = 0. If a = O, then as b = ac, we have b = 0 as well. Otherwise, 
1—cd =0,socd = 1. Thus, d € {1, —1}, soa € {b, —b}. 


2.15 Letd = (b,c). Then d|c and cla, so dja. But also d|b. As (a, b) = 1, we must 
have d = 1. 


2.17 If a and n are relatively prime, write au + nv = 1. Then n(—v) = au — 1. 
On the other hand, if (a,n) = d > | and au — 1 = nb, for some b € Z, then 
1 = au — nb. Now, d|a and d|n, so d|1, which is impossible. 


2.19 By strong induction on n. It is clear ifn < 4. So let n > 4 and suppose 
that it is true for smaller n. Then f, = fn—1 + fn—2 = (fn-2 + fn—-3) + fa—-2 = 
2 fn—2 + Sn-3 = 2fn 3 Sn 4) + Sn 7 = 3fn 34 2fn 4. If 4|n, then 4|(n — 4), 
so 3| f,-4, and hence 3] f,. Suppose that 4 { n. Then 4 { (n — 4), so 3 { fy—a. If 
3| fr, then 3|( fp — 3 fn—3) = 2 fn—4. As (3, 2) = 1, we see that 3] f,-4, giving us a 


contradiction. 
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2.21 3528 = 2? . 37. 77, 30030 = 2-3-5-7-11- 13 and 220000 = 2° - 5*- 11. 


2.23 Let d = (p,n). As d is a positive integer and d|p, we can only have d = 1 or 
p. If d = 1, we are done. If d = p, then as d|n, we are done. 


2.25 If pil(pi-+: pe + 1), then as pj|pi--- pe, we have pj|(pi--: pe + 1 — 
Pi-++* Px) = 1, which is impossible. 


2.27 By Corollary 2.4, pla. Let us say a = pb, with b € Z. Then a” = p"b", so 
p" ja”. 


2.29 (1) and (2) are clearly commutative, but (3) is not, since 1*2 = | but2*1 = 2. 


2.31 (1) No. There is no e € Z such that 2e + 1 = 2. 
(2) Yes, lete = 0. Thenaxe=a=exa forallaeé Z. 


2.33 (1) 4. 

Q) 445)? = (1)? ==1=6. 
2.35 (2) We have [a]+({b]+[c]) = [a]+[b+c¢] = [a+ (b+0)] = [(at+b) +c] = 
[a + b] + [c] = ([a] + [b)) + [el. 

(4) Note that [a] + [0] = [a + 0] = [a]. 

(5) Observe that [a] + [—a] = [a + (—a)] = [0]. 


2.37 2-10=4-5=6-10=8-15=12-5=14-10=16-5=18-10=0. If 
a € {1,3, 7,9, 11, 13, 17, 19}, there is no such b. 


2.39 Ifa? = lin Z,, then a? = 1 (mod p);thatis, p|(a*—1) = (a—1)(a+1). By 
Euclid’s lemma, p|(a— 1) or p|(a+ 1). Thatis,a = 1 (mod p) ora =—-1= p-1 
(mod p). If p = 8, then 1, 3, 5 and 7 are solutions. 


2.41 Proceeding as in the proof of Theorem 2.13, we have d; = 70, dy = 30 
and d3; = 21. Solving 3u; + 70v; = 1 using the Euclidean algorithm, we get 
uy = —23 and vy; = 1. Solving 7u2 + 30v2 = 1, we get uz = 13 and v2 = —3. 
Solving 10u3 + 21v3 = 1, we get uz = —2 and v3 = 1. Thus, our answer is 
a = 70-1-2+30(—3)(4) +21-1-3 = —157. (As our answer is only unique modulo 
3-7-10 = 210, we can also add 210 and get 53.) 


Problems of Chapter 3 


Olea! 
1234 
OS.) 


1234 
ors) 
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3.3. There are n places to map 1, then n — 1 to map 2, and so on. So there are n! 
permutations. If we fix 2, then the other four numbers can be arranged at will, so 
there are 4! = 24 possibilities. 


3.5 Closure: Yes, the composition of two functions is a function. Associativity: 
Yes, the composition of functions is associative. Identity: Yes, we have the identity 
function sending each element of {1, 2,3, 4,5} to itself. Inverses: No. Define a : 
{1,2,3,4,5} — {1, 2, 3,4, 5} via w(i) = | for all i. There is no possible function 
B: {1,2,3,4,5} > {1, 2, 3,4, 5} such that w6 is the identity function. 


3.7 (1) 


(2) 

(0, 0) (1, 0) (2,0) (0, 1) Gd, 1) 2, 2D 
(0, 0)|(0, 0) (1, 0) (2, 0) (0, 1) C, 1) (2, 1) 
(1, 0))(1, 0) (2, 0) (0, 0) G, 1) @, 1) (, 1) 
(2, 0)|(2, 0) (0, 0) C1, 0) @, 1) ©, 1) dG, 1) 
(0, 1)}(0, 1) G1, 1) @, 1) (0, 0) C1, 0) (@, 0) 
dd, 1)/d, 2) @, 1) (, 1) A, 0) (2, 0) (0, 0) 
(2, 1)/(2, 1) (0, 1) C1, 1) 2, 0) (0, 0) C1, 0) 


3.9 Take g; € G,h; € H. Now, (g1,1)(g2, 2) = (g2, h2)(g1, h1) if and only if 
(8182, hjh2) = (g2g81, h2hy); that is, if and only if g)g2 = g2g) and hyh2 = hgh. 


3.11 (1) Division is not associative; for instance, (1/2)/3 4 1/(2/3). 
(2) There is no inverse for 2 (or anything else other than 1). 


3.13 Yes. Let G be our set. If a+ bi,c + di € G, then (a + bi)(c + di) = 
(ac—bd)+(ad+bc)i. Now, (ac—bd)?+(ad+bce? =a? +b? +a? +h? = 
(a? + b*)(c? +d?) =1-1=1,80 (a+ bi)(c+di) €G. Complex multiplication 
is associative. Clearly | € G, and it will serve as the identity. If a + bi € G, then 
(a + bi)(a — bi) = a2 + &* = 1. Now, a* + (—b)? = 1, soa—bi € G and 
(a+bi)!=a-—bi. 


3.15 Yes. To show closure, note that (a/p") + (b/p™) = (ap”™ + bp")/p™*" € G. 
Addition of rational numbers is certainly associative, and 0 = 0/p is the additive 
identity. The additive inverse of a/p” is —a/p” € G. 


3.17 (1) aca7!b7'! cA}. 
(2) a~!e"!b-!a. 
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aldabc 
biabcd 
clbcda 
djcdab 


3.21 If (1) holds, then for any g,h € G, leta = g,b = hg,c = gh. Then 
ab = ca = ghg, so by assumption, hg = gh, and G is abelian. If (2) holds, then 
whenever ab = ca, we have ab = ac, so by cancellation, b = c. 


3.23 (1) |Zj2| = 12. Also, |O| = 1, |1] = |5| = |7| = |11] = 12, |2| = |10] = 6, 
|3| = |9| = 4, |4] = [8] = 3 and |6| = 2. 

(2) |Z2 x Z4| = 8. Also, |(0, 0)| = 1, |C1, 0)| = |, 2)| = |(O, 2)| = 2 and every 
other element has order 4. 


3.25 |a3| = 20/(3,20) = 20/1 = 20, Ja!2| = 20/(12,20) = 20/4 = 5 and 


|a!5| = 20/(15, 20) = 20/5 = 4. 


3.27 We are looking for the smallest positive integer n such that (a, b)” = (e, e); 
that is, such that a” = e and b” = e. But a” = e if and only if 12|n and b” = e if 
and only if 18|n. Thus, we want the smallest positive integer n divisible by both 12 
and 18. The order is 36. 


3.29 (1) Note that a” = e if and only if (a”)~'! = e7!; that is, if and only if 
(a“!)" =e. 
(2) Recall that conjugates have the same order. Also, ab = b~!(ba)b. 


3.31 First, note that U(8) has exactly three elements of order 2, namely 3, 5 and 7. 
Suppose that a and b are distinct elements of order 2. Now, (ab) =P=e=e, 
since G is abelian. Furthermore, if ab = e, then a = b~! = b, since b has order 2. 
But this is impossible. Thus, |ab| = 2. Furthermore, if ab = a, then b = e and if 
ab = b, then a = e. Thus, a, Db and ab are distinct elements of order 2. Let c be a 
fourth distinct element of order 2. By the same argument, ac has order 2. If ac = a, 
then c = e. If ac = b, thenc = a~'b = ab. If ac = ab, then c = b. None of these 


are true. Thus, ac is a fifth distinct element of order 2. 


3.33 (1) Yes. Clearly H contains the identity matrix. If A, B € H, then det(AB) = 
det(A) det(B) = 1-1 = 1, so AB € H. Furthermore, det(A7!) = 1/det(A) = 1, 
soA eH. 

(2) No. H does not contain the identity. 

(3) Yes. First, we see that 0 = 0/1 € H. Next, if a/b, c/d € H, then (a/b) + 
(c/d) = (ad + bc)/(bd) € H, and —(a/b) = (—a)/b € H. 


3.35 Let F be any flip and R any rotation. Drawing out the effects of each operation, 
we find that FR = R~'F. This is RF if and only if R = R7!. Letting R = R360/n, 
we find that R # R~!. Thus, no flip is central. In fact, R = R7! if and only if 
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R = Ro or R = Rigo. If n is odd, there is no Rig9, so Z(D2,) = {Ro}. If n is 
even, we see that Rig9 commutes with every flip, and surely with every rotation, so 
Z(Don) = {Ro, Rigo}- 


3.37 Let H and K be subgroups of G. Ifa,b € HN K,thena,b € H,soabe H. 
Similarly, ab € K, and therefore ab ¢ HM K. By the same argument, a'ed, 

“le K,soa! € HK. Finally, as H and K are subgroups, e € H ande € K, 
soe € HK. The argument for an arbitrary intersection is similar. 


3.39 (1) (0) = {0}, (1) = (3) = (7) = (9) = (11) = (13) = (17) = (19) = Za0, 
(2) = (6) = (14) = (18) = {0, 2, 4, 6, 8, 10, 12, 14, 16, 18}, (4) = (8) = (12) = 
(16) = {0, 4, 8, 12, 16}, (5) = (15) = {0, 5, 10, 15}, (10) = {0, 10}. 

(2) (1) = {1}, (3) = (11) = {1, 3, 9, 11}, (5) = (13) = {1, 5, 9, 13}, (7) = {1, 7}, 


(9) = {1, 9}, (15) = {1, 15}. 


3.41 Label the vertices of the regular n-gon from | to n, counterclockwise. Then 
notice that a rotation leaves the vertices in counterclockwise order, whereas a flip 
changes them to clockwise. This makes clear what must happen in each case. 


3.43 (1) We have (a!) = G, (a?) = de a’, a*,a®, a®, a'}, (a3) = {e, a>, a®, a*}, 
(a*) = {e, a4, a’}, (a°) = {e, a®}, (a?) = (e}. 

(2) As Zj2 is cyclic of order 12 with generator 1, we have (1) = Zj2, (2) = 
{0, 2, 4, 6, 8, 10}, (3) = {0, 3, 6, 9}, (4) = {0, 4, 8}, (6) = {0, 6}, (0) = {0}. 


3.45 A positive integer is not relatively prime to p” if and only if it is divisible by 
p. Thus, we are excluding p, 2p, 3p,..., p". There are p"~! such numbers. 


3.47 It does follow, as H 1 K is a subgroup of H, and every subgroup of a cyclic 
group is cyclic. 


3.49 Let G = (a) be a cyclic group of order n. Now, |a'| = n/(n, i), for every 
integer i. In particular, each element of G has order dividing n. Now, for every k 
dividing n, the number of elements of order k is y(k). Thus, the sum of the g(k) is 
the number of elements in G, namely, n 


3.51 (1) Ifa € HM K has order n, then (a) is a subgroup of order n in both H and 
K. As |H| = |K|=2, this means that H = K = (a), which is impossible. 

(2) If G has no elements of order n, we are done. Otherwise, take a € G of order 
n. Then (a) has g(n) elements of order n. If those are all of the elements in G, we are 
done. Otherwise, find b ¢ (a) of order n. Then (b) contains y(n) elements of order 
n, and by (1), (a) and (b) have no elements of order n in common. Thus, we now 
have 2g(n) elements of order n. Repeat. If the process stops, we have a multiple of 
y(n). If not, we have infinitely many. 


3.53 (1) Leftcosets:0+-H = {...,—-4,0,4,8,...}, 14H ={...,—-3,1,5,9,...}, 
2+ H ={...,—2,2,6,10,...},3+H4 ={...,—-1,3,7, 1l,...}. As G is abelian, 
the right cosets are the same. 

(2) Left cosets: RoH = {Ro, Fr}, Roo = {Roo, Fa}, RisoH = {Rigo, Fi}, 
RowH = { R270, F3}. Right cosets: HRo = {Ro, F5}, H Roo => { Roo, F3}, A Rigo => 
{Riso, Fi}, A Ro70 = {Ro70, Fa}. 
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3.55 Let G = pq. If H < G, then |H| divides |G|, so |H| € {1, p,q, pq}. As H 
is a proper subgroup, the order is not pq. But the trivial group is cyclic, as is any 
group of prime order. 


3.57 As HK isa subgroup of H and K, its order divides both 28 and 65. But 
(28, 65) = 1, so we can only have HM K = {e}. 


3.59 By Exercise 3.30, a; --- a, has order 1 or 2. But a group of odd order has no 
elements of order 2. 


3.61 Suppose otherwise, and leth;(HNK), ...,4n+1:CH MK) be distinct left cosets 
of HO K in H. Thenifi ¢ j, we have hy ‘hj ¢ HK. Since hy hj € H, we have 
hy 'hj ¢ K.Thatis,h,K,...,h,4,K are distinct left cosets of K in G, contradicting 
the assumption that [G : K] =n. 


Problems of Chapter 4 


4.1 (1) Yes. Clearly H contains the identity matrix. If A, B € H, then det(AB) = 
det(A) det(B) € Q, and det(A~!) = 1/det(A) € Q; thus, AB, A! € H, and H is 
a subgroup. Also, if C € GL2(R), then det(C~'AC) = det(C~!) det(A) det(C) = 
det(A) det(C~!) det(C) = det(AC~!C) = det(A) € Q; thus, C~'AC € H,so H 
is normal. 


=i 
11 10\ /11 1-1 we 2 : 
(2) No, é i} i 5) ( i} = (( > ), which is not diagonal. 


4.3 Let N = {e,a}. If b € G, then b“'ab € N. But if b~'ab = e, thena = 
beb“! = e; impossible. Thus, b-'ab = a, and a is central; naturally, e is always 
central. 


4.5 Let N and K be normal subgroups of G. By Exercise 3.37, NOK is asubgroup. 
Leta € NOK andg € G.Sincea € N, we have g~!ag € N. Similarly, g~'ag € K, 
so g-!ag € NK. The proof of the generalization is similar. 


4.7 Ifa € G, then a~'Ha is a subgroup of order n, soa! Ha = H. 
4.9 Ifg ¢G,ne€N, thenn—'g-'ng € N; thus, g~'ng = n(n“!g7'!ng) € N. 


4.11 We know that |a| is divisible by 5. Also, (aN)? = eN, so a> € N. By 
Lagrange’s theorem, (a>)'* = e. Thus, the order of a divides 70. So |a| € 
{5, 10, 35, 70}. To see that these are all possible, let G = Z79, N = (5) and let 
a be 14, 7, 2 and 1, respectively. 


4.13 For both parts, G = Dg x Z will suffice. We have Z(G) = (Rigo) x Z. 
As G/Z(G) has order 4, it clearly satisfies (2). As for (1), it remains to show that 
Dg/(Rjgo) is abelian. But this can be seen by examining the group table from Exer- 
cise 4.12. 
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4.15 Let |aN| = 42. As G is finite, we know that a has finite order, and so its order 
is a multiple of 42, say 42n. But then a” has order 42. It need not hold for infinite 
groups. Indeed, let G = Z, N = (42) and a = 1. We see that |1 + N| = 42, but 
every nonidentity element of G has infinite order. 


4.17 By Exercise 4.16, a~'b~'ab € K, for alla, b € G. Similarly, a~'!b~'ab € N. 
But K NN = {e}, so a~!b-!ab = e, and hence ab = ba. 


4.19 Clearly e € N. If a,b € N, say a’ = b! = e, for some k,/ € N, then 
(ab) = (a*)'(b!')* = eek = e; thus, ab € N. Also, |a| = |a~!|, soa! € N. Thus, 
N is a subgroup. As G is abelian, it is normal. Take any c € G. If, for some n € N, 
we have (cN)" = eN, then c” € N;; that is, c” has finite order, so c”” = e for some 
m € N. In other words, c € N,socN =eN. 


4.21 (1) It is a homomorphism. Indeed, w(ab) = log)y(ab) = logj,a + logy) b = 
a(a) + a(b). It is one-to-one, as if a(a) = 0, then log;ya = 0, soa = 1; that is, 
ker(a) = {1}. It is also onto, as if b € R, then @(10") = b. 

(2) It is not a homomorphism, as 6(0 + 0) = 1 4 2 = 6(0) + BO). 


4.23 We have a((a, b)(c,d)) = a((ac,bd)) = ac(bd)~'! = ab~'cd7' (since 
U (16) is abelian), and this is a((a, b))a((c, d)). Thus, a is ahomomorphism. Also, 
(7) = {1, 7}. Now, a((a, b)) = 1 if and only if ab—! = 1; that is, we have the pairs 
(1, 1), (3, 3),..., 5, 15). Similarly, a((a, b)) = 7 if and only if a = 7b, so we 
have the pairs (7, 1), (5, 3), (3,5), G, 7), (5, 9), (3, 11), C11, 13), , 15). 


4.25 (1) Not necessarily. For instance, H could be the trivial group. 
(2) Yes. Leth € H have order n. As @ is onto, say a(g) = h. Since G is finite, 
|g| < ov, so |h| divides |g|. Let us say that |g| = mn. Then |g’”"| =n. 


4.27 Note that gh = a((g,h)) = a((e, h)(g, e)) =a((e, h))a((g, e)) = hg. 


4.29 Let H = G/N. Definea : G > H viaa(g) = gN. It is a homomorphism, 
as a(g122) = gig2N = giNgoN = a(gi)a(g2), and g € ker(q@) if and only if 
gN =eN; that is, if and only if g € N. 


4.31 (1) Count the elements of order 2. 

(2) One is abelian and the other is not. 

(3) We know that Z is cyclic. Suppose that Z x Z = ((a, b)). Then there exists an 
n € Zsuch that (1, 0) = n(a, b). Since n cannot be 0, we see that b = 0. Similarly, 
a = 0. But this is impossible. 


4.33 Define a : Z > GL,(R) via a(a) = (; '): Note that a(a + b) = 


1 O 10)\ /10 : ; 
(, 15 = (; '} 6 ‘f= a(a)a(b). Thus, a is a homomorphism. In par- 


ticular, a(Z) = G is a subgroup of GL>2(R). Furthermore, if @(a) is the identity 
matrix, then a = 0; thus, @ is one-to-one. Therefore, Z is isomorphic to a(Z) = G. 
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4.35 Define a: H > a~!Ha viaa(h) = a ‘ha. Then for any h,k € H, we have 
a(hk) = a'hka = a'haa~'ka = a(h)a(k); thus, o is a homomorphism. By 
definition, it is onto. Also, if h € ker(a), then a~'ha = e; therefore, h = aa~' = e, 


and a is one-to-one. 


4.37 Ifn > 1 is a positive integer, then nZ is a proper subgroup which is infinite 
cyclic, and therefore isomorphic to Z. 


4.39 Define a : G > G via a((aj, @,...)) = (0, a), &,...). It is a homomor- 
phism; indeed, if a = (a, dz,...) and b = (bj, bo,...), then a(a + b) = a(a) + 
a(b) = (0, a; +b}, ag +b2,...). Furthermore, it is one-to-one; if a(a) = (0,0,...), 
then clearly a = (0,0,...). Thus, G is isomorphic to a(G), which is a proper 
subgroup of G. 


4.41 Define a: G > Zviaa((a, b)) = a — b. Now, a is ahomomorphism, since 
a((a, b) + (c,d)) =a((a+c,b+d)) = (a+c)—-(b+d) = (a—b)+(c-d)= 
a((a, b)) + a((c, d)). Also, a is onto, since for any a € Z, a((a, 0)) = a. Finally, 
the kernel is the set of all (a, b) such that a — b = 0; that is, ker(a) = N. Apply the 
First Isomorphism Theorem. 


4.43 Definea :R — A viaa(r) = cos(2zr) + sin(21r)i (where we are working 
in radians). As cos?(@) + sin?(@) = 1 for any @ € R, we see that a(R) C H. 
Furthermore, for any a,b € R such that a2 + b* = 1, we can surely findr € R 
such that cos(27r) = a and sin(27r) = b; thus, a(R) = H. To show that @ is a 
homomorphism, calculate a(r + s) and a(r)a(s) and use trigonometric identities. 
Finally, the kernel is the set of all r € IR such that cos(27r) = 1 and sin(27s) = 0; 
that is, ker(@#) = Z. Apply the First Isomorphism Theorem. 


lab 110 
4.45 (1) If | 0 1c ] commutes with | 0 1 0], then c = 0. Similarly, a = 0. But 
001 001 


matrices with a = c = 0 are easily seen to commute with everything in G, so those 
matrices form the centre. 


lab 
(2) Definea: G> Zx Zviaa Olc = (a,c). We can see that 
001 
lab lde la+de+af+b 
a Olc Olf =a 0 1 c+f 
001 001 0 O 1 
lab lde 
=(a+d,c+f)=a Olc +a Olf F 
001 001 


so a is ahomomorphism. It is clearly onto. Furthermore, its kernel is precisely Z(G), 
as we found in the first part. Now apply the First Isomorphism Theorem. 
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4.47 As a(ab) = (ab)” = a"b™ = a(a)a(b) (since G is abelian), we know that 
a is a homomorphism. If a(a) = e, then a” = e, and hence |a| divides m. But 
by Lagrange’s theorem, |a| divides n as well. Since (m,n) = 1, we can only have 
|a| = 1. Thus, a is one-to-one. As G is finite, it must be onto as well. 


4.49 Let a be an automorphism of Z x Zy. Now, any homomorphism sends the 
identity to the identity. As w is one-to-one, a((1,0)) € {(1, 0), (0, 1), 1, 1}. Fur- 
thermore, once a((1, 0)) is chosen, that leaves only two possibilities for a((0, 1)). 
Once both of these are decided, there is only one option left for a((1, 1)). So there 
are only 3 - 2 = 6 possible automorphisms. This does not mean that all of them are 
necessarily automorphisms, but as it happens, they are. To see this, note that every 
group of prime order is cyclic, and groups of order 4 are abelian. Thus, since Exam- 
ple 4.26 shows us that there are noncommuting automorphisms, no order less than 6 
is possible. So all of the functions we have considered are actually automorphisms. 
Also, every group of order 6 is isomorphic to Z or Dg. As the automorphism group 
is nonabelian, it must be De. 


4.51 As a(e) = e, we see that e is in our set. If a(a) = a and a(b) = BD, then 


a(ab) = a(a)a(b) = ab, so ab is in our set. Also, a(a~!) = (a(a))~! = a7!" so 


a~' is in our set. 


4.53 As a((a) x {e}) © (a) x {e}, let us say that a((a, e)) = (a’, e). Similarly, 
a((e, b)) = (e, b/) and a((a, b)) = (a, b)*. But then (a’, b/) = (a*, b*). That is, 
a((a,e)) = (a‘, e) and a((e, b)) = (e, b*). Then for any r,s € Z,a((a", b*)) = 
a((a, e))'a((e, b))* = (a*, e)"(e, DY’ = Ca", b’). 


4.55 If m is an integer, then we know that a(m) = a(m- 1) = m-a(1). Ifnisa 
nonzero integer, then a(m) = a (n- “) = na (“). Thus, a (“) = 4a(m) = “a(1). 


n n 


Problems of Chapter 5 


5.1 Let H = (3) and K = (31). We see that |H| = 8, |K| = 2. As the group is 
abelian, both subgroups are normal, and HM K = {1}. Thus |HK| = 8-2/1 = 
16 = |U(@2)|, so U(32) = Hx K. 


5.3 Note that 5(a, b) = (0, 0) if and only if 5a = 0 and 5b = 0. But 5a = 0 for all 
a, whereas 5b = 0 if and only if b € {0,5, 10, 15, 20}. Thus, 5 - 5 = 25 elements 
satisfy 5(a, b) = 0. Now, these elements have order dividing 5, so we need only 
exclude the identity, which has order 1; thus, there are 24 elements of order 5. As 
25(a, b) = (0, 0) for all a and b, we see that every element has order 1, 5 or 25. We 
have found 25 elements not having order 25, which means that 5 - 25 — 25 = 100 
elements have order 25. 


5.5 If Dg = H x K, then |H||K| = 8. As the subgroups are both proper, || = 4 
and |K| = 2 (or vice versa). By Corollaries 4.2 and 4.3, H and K are abelian, so Dg 
is abelian, giving us a contradiction. 
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5.7 Let G = Z) x Z, Ni = (C,0)), No = ((0,1)) and N3 = ((1, 1)). As 
G is abelian, normality is not an issue. We can see that G = N,N, so surely 
G = N|N2N;3. Also, each N;jON; = {(0, 0)}. But we cannot have G = N, x No x N3, 
since the order is wrong. 


5.9 It does not follow. Let G = Z. x Z) and H = Zp». Definea : G > H via 
a((a, b)) = a+b. Wehavea((a, b)+(c, d)) = a+b+c+d = a((a, b))+a((c, d)), 
so a is a homomorphism. As a((0,0)) = 0 and a((1,0)) = 1, we see that a is 
onto. Now, G = ((1,0)) x ((0, 1)), but a((C1, 0))) = a(((O, 1))) = A; thus, the 
intersection of the images is not trivial, so we do not have a direct product in H. 


5.11 (1) Z3 x Zr. 

(2) Zs, Zn x Z3, Zo x Zo, Zo x Z3 x Z3, Z3 x Z3 x Z3 x Z3. 

(3) Zg x Zs x Zag, Zig X Zo X Z5 X Zgg, Z2 X Zn X Zn X Z95 X Lgg, Lg X Zs X Zs X Zgg, 
Za X Zn X Zs x Zs x Lag, Zo X Ly X Zn X Ls X Zs X Ly, Zg X Zo5 xX Z7 X Zr, 
Za X Zo X Los x Z7 X Ly, Zn X Zn X Zo X Dos X ZX Hy, Zp x Zs xX Zs x Z x Zr, 
Za X Zo X Zs X Zs X Zq X Z7, Zn X Dy X Dy xX Zs xX 5 x Ly X Z7. 


5.13 As |U(56)| = g(56) = 24, the possibilities are Zg x Z3, Z4 x Zy x Z3 and 
Zo X Zy x Zz x Z3. But running through the elements of U (56), we see that none 
have order larger than 6. As Zg x Zs and Z4 x Zp x Z3 both have elements of order 
12, it must be Z) x Zo x Zo x Zs. 


5.15 We see that G is isomorphic to a direct product of groups of the form Zp, for 
various n; € N. But if n; > 1, then such a group has elements of order p?. 


5.17 Solving 5u + 7v = | in Z, one possible solution is u = 3, v = —2. Then 
a = a"*7" = (a°)3(a’)~*. Now |a| = 7 and |a~!4| = |a?!| =5. 


5.19 We proceed by strong induction on |G|. There is nothing to do if |G| = 1, so 
we start the induction with |G| = 2. In this case, p = 2 and G has an element of 
order 2. Let |G| > 2 and assume the result for groups of smaller order. Ife £ b € G, 
then choose some prime gq dividing |b|. Let a = b!"!/4, Then |a| = g. If gq = p, 
we are done. Otherwise |G/(a)| = |G|/q, and this is still divisible by p. By our 
inductive hypothesis, G/(a) has an element c(a) of order p. Thus, c? € (a), so 
c?4 = e. Hence, |c’| = 1 or p. But if c? = e, then (c(a))% = e(a). As |c(a)| = p, 
this is impossible. 


5.21 (1) 8, 2, 3, 3, 25, 7, 7. 
(2) 2,.2,2, 3,9, 27. 
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5.25 It is obviously the case for n = 1. For larger n, we claim that it is true if and 
only if 7 is a product of distinct primes. Ifn = pj, --- px, where the p; are all distinct, 
then the only possible list of elementary divisors is p;,..., pg, So the groups are all 
isomorphic. On the other hand, if p?|n for some prime p, then we have the cyclic 
group of order n and Zp x Zn/p. Since (p,n/p) = p > 1, we see that this group is 
not cyclic. 
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5.27 The list of elementary divisors of G; x Gz is obtained by combining the lists 
of elementary divisors of G; and G2. Similarly for G; x G3. If these lists are the 
same, then deleting the elementary divisors of G, from each list, we see that Gz and 
G3 have the same elementary divisors, and hence are isomorphic. 


5.29 We know that G is isomorphic to Zon x +--+ X Zon. If 2(aj,...,a%) = 
(0,...,0), then 2a; = O for all i; that is, each a; has order 1 or 2. But a cyclic 
group of order 2” has only one element of order 2, so there are only two such a;, for 
each i. In total, we get 2* elements. But we must exclude the identity, so our number 
is 2* — 1. 


5.31 (1) Remember that g + Z =r + Zif and only if g — r € Z. This is basically 
the same as Example 1.19, using Q instead of R. 

(2) We have b(a/b+ Z) =a+Z=0+4Z. Thus, |a/b+ Z| < b. Butifc e N 
and c(a/b+ Z) = 04+ Z, then ca/b € Z; that is, blac. Since (a, b) = 1, this means 
that b|c. In particular, c > b, so the order is b. 


5.33 We havea(a+b) =n(a+b) =na+nb = a(a) +a(b), so @ is a homomor- 
phism. If a € G, then since G is divisible, there exists a b € G such that nb = a. 
Thus, a(b) = a, and a is onto. But it is not necessarily an isomorphism. Let G be 
the Priifer p-group and take n = p. Then we see that 1/p + Z € ker(q@). 


5.35 If N is a subgroup of G, take a+ N € G/N. Then for any n €N, there exists 
ab € G such that nb = a. Therefore, n(b + N) = a+ N, and G/N is divisible. 
However, Q is divisible but Z is not, as there is no b € Z such that 2b = 1. 


5.37 Let G/N = (aN). If gN € G/N, then gN = (aN)*, for some k € Z. That 
is, g = a‘n, for some n € N. In other words, G = (a)N. Ife 4b € (a) NN, 
then b = a! € N, for some 0 #/ € Z. If] < 0, then we may replace b with b~!, 
so let / > 0. Then (aN)! = eN, which means that aN has finite order in G/N. As 
G/N = (aN) and G/N is infinite cyclic, this is impossible. Therefore,G = (a) x N. 
It remains only to show that (a) is infinite cyclic. It is surely cyclic, and if |a| = k, 
then again, (aN) = eN gives us a contradiction. 


Problems of Chapter 6 


6.1 (1) (247 6)(3 5) 
(2) 1 2 5)(3 6 4)(7 8) 


6.3 (1) 142) 675). 
(2) Writing the permutation as a product of disjoint cycles, we get (1 2)(3 5 4), 
so the inverse is (1 2)(3 4 5). 


6.5 An element of order 3 must be a product of one or more disjoint 3-cycles. Let 
us count the 3-cycles (a b c). There are 9 choices for a, 8 for b and 7 for c. But 
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(a bc) = (bc a) = (c a b), so we must divide by 3, giving 9-8 - 7/3 = 168. 
For pairs of disjoint 3-cycles, we get9-8-7-6-5-4/(3-3.-2) = 3360, using the 
same argument and the fact that the order of the two cycles is irrelevant. Finally, to 
get three disjoint 3-cycles, we have 9!/(3 - 3 - 3 - 3!) = 2240, again noting that the 
three cycles can be permuted as we please. Our total is 5768. 


6.7 If t exists, then it has order k. Thus, if k is even, then t? has order k/2, and 
therefore it cannot be a k-cycle. So suppose that k is odd. Let t = o *+!/?, Then 
t? = o*+! = o, as has order k. Furthermore, as 2((k + 1)/2) + (—Dk = 1, we 
know that ((k + 1)/2, k) = 1. The preceding exercise tells us that t is a k-cycle. 


6.9 Let |o| = 105 = 3-5-7. We know that |o| is the least common multiple of the 
lengths of its cycles in the disjoint cycle decomposition. The product of a 3-cycle, a 
5-cycle and a 7-cycle would work, som = 15 is a possibility. Can there be a smaller 
value? There must surely be a cycle whose length is a multiple of 7 and a divisor of 
105. If it is smaller than 15, it can only be 7. Similarly for 3 and 5. Thus, m = 15 is 
the smallest possible value. 

Let |t| = 125. The only way to make this happen is for the disjoint cycle decom- 
position for t to include a 125-cycle. We see that n = 125. 


6.11 (1) even 
(2) odd 


6.13 Without loss of generality, the possible products are (1 2)(1 2) = (1), having 
order 1, (1 2)(1 3) = (1 3 2), having order 3 and (1 2)(3 4), having order 2. 


6.15 Itis certainly impossible if n is 2 or 3, as the groups are too small. But ifn > 4, 
then A,, has the subgroup {(1), (1 2) 4), 1 3)@ 4), 1 4@ 3)}. dt contains the 
identity, and closure is easily checked.) This subgroup is not cyclic. Indeed, if o is 
a permutation of order 4, then its disjoint cycle decomposition is a product of one 
or more 4-cycles and, possibly, some 2-cycles. But a 4-cycle by itself is odd, so we 
need n > 6 to get something like (1 2 3 4)(5 6) € Ap. 


6.17 The order of an element in S,, is the least common multiple of the lengths of 
its disjoint cycles. If this order is odd, then these cycles all have odd length. But a 
cycle of odd length is even. 


6.19 We see by inspection that n = | falls into the second category and n = 2 and 
3 fall into the third category. Let n > 4. By Exercise 6.17, all elements of odd order 
lie in A,. As A, contains half the elements of S,, we see that there are at least as 
many elements of even order as of odd order, and they can only be equal if every 
element of A, has odd order. However, (12)(34) € A, has order 2. So ifn > 4, we 
are in the first category. 


6.21 Sucha subgroup would have index 2, and therefore be normal, by Theorem 4.1. 
But As is simple. 
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6.23 It can. Note that Ag has an isomorphic copy of As as a proper subgroup. (Just 
use the exact same permutations as in As, assuming that each fixes the number 6.) 


6.25 Let N bea nontrivial proper normal subgroup of A4. By the preceding exercise, 
N contains no 3-cycles, so N is a subgroup of the group exhibited in Example 6.11. 
If it is not the same group, then it can only have order 2. But by Exercise 4.3, a 
normal subgroup of order 2 is central. However, the elements of order 2 in A4 are 
the products of two disjoint transpositions, and these are not central. For instance, 
(1 2)3 40 23) 4 23) 2)(3 4). 


6.27 In view of the preceding exercise, it suffices to show that each (1 i),2 <i <n, 
is the product of such transpositions. We proceed by induction on i, beginning with 
i = 2. There is nothing to do there, so assume the result for 7 and prove it for i + 1, 
when | <i <n. However, iG @+1)0 0G @+1)) = d @+ 1), completing the 
proof. 


Problems of Chapter 7 


7.1 Let A = ¢ 3 € GL»2(R). Then A commutes with (; i} if and only if 


a=a+c,a+b=b+dandc+d = d; that is, if and only ifc = 0 anda = d. 
Thus, the matrices in the centralizer have the form é A where a,b € R (and 
a # 0, so that the matrix is invertible). 


7.3 We always have C(H) C N(H).Let A = ( ) and suppose that B € N(#). 


Then B-'AB € H,so B~'AB = A", for some integer n. However, det(B~'!AB) = 
det(A) = —3, whereas det(A”) = (det(A))” = (—3)”. We conclude that n = 1, and 
hence B~'AB = A. 


7.5 As always, C(H) C N(H). Let H = {e, a}. If b € N(#), then b-'eb = e and 
we must have b~!ab € H.Ifb7!ab = e, thena = e, whichis impossible. Therefore, 
b-'ab = a,andb € C(B). 


7.7 Take b € C(a). As C(a) isasubgroup, b~! € C(a),sob~!a = ab™!. Inverting, 
we get a-'b = ba™!; thus, b € C(a7'). This means that C(a) € C(a7!) © 
Ci@"y) = C@). 


7.9 (1) As H has prime order, it is abelian, so H < C(#). In particular, |C(H)| is 
divisible by 11 and divides 77, so itis 11 or 77. If itis 11, we must have C(H) = H. 
Otherwise, H < Z(G). In the same way, we now have |Z(G)| = 11 or 77. Since G 
is not abelian, Z(G) = H. But this contradicts Corollary 4.1. 

(2) Suppose otherwise. Combining (1) with Theorem 7.3, and noting that H is 
normal, we have G/H isomorphic to a subgroup of Aut( 7). By Theorem 4.14, H is 
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isomorphic to Z;,, and Theorem 4.22 tells us that Aut() is isomorphic to U(11). 
But this is a group of order 10 and cannot have a subgroup of order 7. 


7.11 {Ro}, {Riso}, (Roo, Roz}, (Fi, Fo}, (F3, Fa}. 


7.13 It does not follow. Let G = S83, H = ((1 3)) and K = ((1 3 2)). Now 
consider the subgroups ((1 2)) and ((2 3)). As (1 3)"'(1 2)(1 3) = 22 3) = 
(1 3 2)-'(1 2)(1 3 2), it follows immediately that these subgroups are both H- and 
K-conjugate. However, HM K = {(1)}, so they are not (H M K)-conjugate. 


7.15 If each [G : C(a)] in the class equation is divisible by Yr, then since |G| is 
also divisible by p”, we must have p* dividing the order of |Z(G)|, which is not the 
case. Thus, since each [G : C(a)] divides p”, one of them must be p. It follows that 
|C(@)| =p". 


7.17 (1) No. Groups of order 25 are abelian, so all conjugacy classes would have 
just one element. 

(2) Yes, S3. 

(3) No, the identity is always in a conjugacy class by itself. 


7.19 Suppose that b~!ab = a~!. Then b-2ab? = b~!a~!b = (b~!ab)~! = a. That 
is, b? € C(a). If G has odd order, so does b. Thus, write |b| = 2m — 1, for some 
m &€N. Then (b*)” = b, so b € C(a). But then b-!ab = a, so a = a“!. That is, 


a? = e. Asa has odd order, a = e, giving us a contradiction. 


7.21 Sylow 2-subgroup: (25) x (7). Sylow 5-subgroup: ((4, 0)). Sylow 7-subgroup: 
((0, 2)). 


7.23 We have |G| = 2-3-7*. The number of Sylow 7-subgroups is 1 + 7k, for 
some nonnegative integer k, and divides 6. The only possible solution is k = 0. 


7.25 Let H be a Sylow p-subgroup of G. By definition, its order is p’, the largest 
power of p dividing |G|. Thus, n < m. By Exercise 7.16, H has a subgroup of order 
p" , 

7.27 By the Second Isomorphism Theorem, H N/N is isomorphic to H/(H 1 N). 
In particular, its order divides || and is therefore a power of p. Furthermore, H < 
HN <G,so|G|/|HN| is a divisor of |G|/| |. In particular, |G|/|H N| is relatively 
prime to p. However, [G/N : HN/N] = (|G\|/|N|)/CAHN|/|N|) = |G|/|HNI. 
Thus, HN/N is indeed a Sylow p-subgroup of G/N. 


7.29 The number of Sylow 7-subgroups is 1 + 7k and divides 12. Thus, it is 1, and 
the Sylow 7-subgroup is normal. 


7.31 The number of Sylow 17-subgroups is 1 + 17k and divides 256, so it is | or 
256. If it is 1, the 17-Sylow subgroup is normal. If it is 256, then we note that each 
Sylow 17-subgroup is cyclic and has 16 elements of order 17. Distinct groups of 
prime order intersect trivially, so we have 16 - 256 = 4096 elements of order 17. 
This leaves only 256 other elements. But this is the size of a Sylow 2-subgroup, so 
there can be only one, and it is normal. 
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7.33. The number of Sylow p-subgroups is | + kp and divides q. If it is not 1, it is 
q, So p|(q — 1), giving us a contradiction. Thus, the Sylow p-subgroup is normal. 
Similarly, the number of Sylow q-subgroups is 1 + /g and divides p. Thus, it is 1, 
p or p*. Suppose it is not 1. If it is p, then g|(p — 1), and since (p — 1)|(p? — 1), we 
have a contradiction. If itis p?, we again obtain a contradiction. Therefore, the Sylow 
q-subgroup is normal as well. Thus, G is the direct product of its Sylow subgroups. 
Now, groups of order a prime or the square of a prime are abelian, and we are done. 


7.35 The number of Sylow 3-subgroups is 1 + 3k and divides 19, so it is 1 or 19. 
If it is 1, then there are 2 elements of order 3. If it is 19, then there are 38, since 
subgroups of prime order intersect trivially. 


7.37 Let H be a Sylow 7-subgroup and K a Sylow 17-subgroup. The number of 
Sylow 7-subgroups is | + 7k and divides 85, so it is 1 or 85. If it is 1, then H is 
normal. By Theorem 4.5, HK is a subgroup, and its order is 7 - 17/1 = 119. So 
assume that there are 85 Sylow 7-subgroups. Then we have 6 - 85 = 510 elements 
of order 7. The number of Sylow 17-subgroups is 1 + 17/ and divides 35, so it is 
1 or 35. If it is 1, then K is normal, and as above, we are done. Otherwise, we get 
16 - 35 = 560 elements of order 17. But we now have too many elements. 


7.39 It is not abelian, so we can rule out the two abelian groups. It has an element of 
order 6, namely (Rj29, 1), so we can rule out Ay. But it has no elements of order 4, 
unlike the group H from Example 7.14, which has ((1 2), 1). Thus, it must be Dj2. 


7.41 It suffices to show that every cyclic subgroup is normal, for then if K < Qg 
anda € K,b € Qg, we have b-!ab € (a) < K. As 1 and —1 are central, we need 
not worry about them. The remaining cases just involve checking, for instance, that 


jTij =—jij =kj =-i =i! € (i). 


7.43 (1) We have ag ,Q¢.4(X) = Ag.p(cx+d) = a(cx+d)+b = acx+ad+b. Thus, 
Qa,bMc.d = Cacad+b € G, since p { a and p{c imply that p { ac; that is, ac is not 0 
in Z,. Thus, we have closure. Composition of functions is always associative. The 
identity is a 9. To find the inverse of a, ,, note that we wantac = | andad+b=0. 
Now, (a, p) = 1, so write au + pv = 1, for some u, v € Z. Thus, au = 1 in Z,. Let 
c = u. Similarly, letting d = —ub, we have ad + b = —aub+ b = 0 in Zp. To see 
that the group is not abelian, note that a9 and a;,; do not commute. 

(2) Let H = {ag» € G: a € {1,2,4}}. Closure checks out as above, since 
products of 1, 2 and 4 remain in {1, 2, 4} in Z7. Clearly a1,9 € H,so H < G. There 
are 3 choices for a and 7 for b, so |H| = 21. Also, H is not abelian for the same 
reason given in the first part. 


7.45 The number of Sylow p-subgroups is | + kp and divides g. As p > q, it is 
1. The number of Sylow qg-subgroups is | + /q and divides p, so it is 1 or p. But if 
itis p, then g|(p — 1), which is not allowed. Therefore, both Sylow subgroups are 
normal, and G is the direct product of Z, and Z,, and hence isomorphic to Zp,, as 


280 Solutions 


Problems of Chapter 8 


8.1 The addition table is found in Table 3.1. For multiplication, the table is as 


follows. 
01234 


00000 
01234 
02413 
03142 
04321 


RwWNrO 


8.3 Itis easy to see that R is closed under addition and contains {0}. Thus, since it is a 
finite set, it is an additive subgroup of Zs, which is an abelian group. Furthermore, R 
is closed under multiplication in Z,5, and we know that this multiplication operation 
is associative and satisfies the distributive laws. Therefore, R is a ring. It is certainly 
commutative, and we can see that 6 is the identity. 


8.5 Itis not aring, as it does not satisfy the distributive laws. Let a(x) = x’, B(x) = 
x and y(x) = 2x. Then (wo (B+ y7))(x) = 9x7, but (wo B)(x) + (ao y)(x) = 5x?. 


8.7 It is easy to see that the sum of two matrices in R also lies in R. Also, matrix 
addition is commutative and associative. The zero matrix is the additive identity, 
and negatives of matrices in R lie in R. Thus, R is an abelian group under addition. 
The product of two matrices in R is easily seen to be in R. Furthermore, matrix 
multiplication is associative and satisfies the distributive laws. Therefore, R is a ring. 


‘ : : . ye : ae i 11 
It contains the identity matrix, so it is a ring with identity. However, (; _) and 
01 i ae 
01 do not commute, so it is not a commutative ring. 


8.9 Not necessarily. Consider the additive group Z,, but define a multiplication 
operation via ab = 0 for all a and b. Clearly this operation is associative and the 
distributive laws are satisfied. Thus, we have a ring with p elements, but there is no 
identity. 


8.11 (1) a2 + ba—ab—b*. 
(2) a3 — a*b — aba — ba? + ab? + bab +. b’a — b’. 


8.13 We have b = b1 = bac = 1c=c. 
8.15 No, use R = Zo @ Zo. 


8.17 Note that (a+bi)—(c+di) = (a—c)+(b—d)i € Rand (a+bi)(c+di) = 
(ac—bd)+(ad+bc)i € R, foralla,b,c,d € Z. Also, 0 € R. Thus, R is a subring. 
In addition, R is a unital subring, as it contains | + 07, the identity of C. 
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Oa 0b 


0 0 00\ (00 00 : ‘ F : 
€ Bi! ») € R,and ic ’) é ’) = (( > € R. Thus, R is a subring. It is a ring 


serves as the identity. But the identity of M2(R) is not there, 


8.19 Certainly R contains the zero matrix. If a,b € R, then € ’) _ (( ’) a 


0 
01 
so it is not a unital subring. 


with identity, as 


8.21 We have (0,0) € T. If rj,ro € R, then (r},0) — (%,0) = (7) —m,0) € T 
and (r;, 0)(r2, 0) = (r172, 0) € T. 


8.23 We have 0 = 0a € S. If rj, ro € R, then rya — ra = (rj —r2)a € S and 
(r}a)(r2a) = (rjar2)a € S. 


8.25 Not necessarily. Let R = Q, S = Zanda = 2. Then 1/2 € T, but (1/2)? ¢ T. 


8.27 We have 1 € R.Ifa+bi,c+di € R, then (a+ bi) — (c+ di) = (a—c)+ 
(b — d)i € R. Furthermore, (a + bi)(c + di) = (ac — bd) + (ad + bc)i € R. If 
c+di £0, then (c + di)(c — di) = c? + d’, which is a nonzero rational number, 


: ccs d: 
so the inverse of c + di is ore — apple R. 


8.29 We have (r,s) € U(R @ S) if and only if there exist r; € R, 5s; € S such that 
rr; =ryr = Land ss; = s,s = 1; that is, if and only ifr € U(R) ands € U(S). 


8.31 If a? =a, then a(a — 1) = 0, s0 since there are no zero divisors, a = 0 or 1. 
An integral domain must have these two elements. 


8.33 By Exercise 8.20, KN L isasubring. As 1 € K andl € L, wehave 1 € KNOL. 
Also, if0 4a é KOL,thena7! € K anda~! € L,soa~! € KL. Thus, KNL 
is a subfield. The proof for an arbitrary collection of subfields is similar. 


8.35 Wehave (a!°)* = (b!°)4 and (a!3)? = (b"3)3. Thatis, a*° = b*° anda®® = b*®. 
So, aa = b*°b = a*’b. If a = 0, then since b*° = 0 and there are no zero divisors, 
b=0. Ifa 4 0, then cancelling a*?, we obtain a = b. 


8.37 (1) 7. 
(2) 0. 


8.39 As | cannot have infinite order in a finite additive group, we know that char 
R = p, for some prime p. Thus, pa = 0 for all a € R, so every element of R has 
additive order 1 or p. If |R| is divisible by some prime g # p, then by Cauchy’s 
theorem, R has an element of additive order g, which is impossible. Thus, the only 
prime dividing |R| is p. 


8.41 (1) We have (1+ a)\1—a+a?—a?+---+(-—D""!a""!) = I. ; 
(2) Let char R = p and choose k such that p* > n. Then (1 + a)? =1+a? 
(using the Freshman’s Dream). But a” = a"a”—" =0,s0 (d+ a)? =, 
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Problems of Chapter 9 


9.1 (1) (0, 0), (0, 1), (0, 2), (0, 3), (0, 4), (0, 5). 
(2) (0, 0), (2, 0), (0, 3), (2, 3). 


9.3 By Exercise 8.20, JM J is a subring. Takea € 1M J andr € R. Thena e/ 
implies ra, ar € I. Similarly, ra,ar € J, sora,ar € 10 J, and I J is an ideal. 
The argument for an arbitrary collection of ideals is similar. 


9.5 (1) Let R = 2Z,a=2. 
(2) Let R = M)(R),a = ¢ yi Every matrix in S is of the form (’ i) for 


some b,c € R. Clearly a € S, but multiplying on the right by (; i) we get a matrix 


not in S, so S is not an ideal. 
9.7 Let R = Zs, I = (2) and J = (4). 


9.9 We know from Exercise 3.42 that G is an additive group. It is clearly abelian. 
It is also closed under multiplication. Furthermore, (qj, d,...)((b1, b2,...) 
(C1, Co,...)) = (ab, C1, Agb2ce2, ...) = (ay, da, ...)(b1, bo, ...)) (Cy, C2, ...). Thus, 
we have associativity of multiplication. The distributive law follows similarly, and 
we have aring. By Exercise 3.42, H is an additive subgroup of G, so it remains only 
to check absorption. If (a), d2,...) € H, (b;, bo, ...) € G, then only finitely many 
of the a; are different from 0, so only finitely many of the a;b; are different from 0, 
and (a,b;, a2b2,...) € H. Similarly for (b, a1, b2az, ...). 


9.11 The addition table may be found in Table 4.2 (replacing each instance of “N” 
with “7”’). The multiplication table follows. 


O+714+1724+73+144+1 
0+7/0+704+704+704+/70+T 
14+7)0+714+724+73+/744+1 
24+7/04+724+744714/734+I 
3+7)04+73471+74+1724+1 
44+7/0+744+/734724+/71+4+T 


9.13 Expanding, we obtain 8x++2x?+7x?+5x+2+I. Now, 2(x3+6x?+2) € I,so 
2x3 +7 = —12x?—4+17. Also, 8x (x3 +6x72+2) E [,so8xt*+1 = —48x3—16x4+1. 
Similarly, 48x3 + I = —288x2 — 96 + J. Thus, our answer is 288x2 + 96 — 16x — 
12x? —44 7x? + 5x + 2+ 7 = 283x? —11x+944+ 7. 


9.15 By the preceding exercise, R/(JM J) is commutative if and only if ab — ba € 
IJ for alla, b € R. But this happens if and only if ab — ba € I andab—bae J 
for all a, b € R; that is, if and only if R/J and R/J are commutative. 


9.17 The only ideals of F are {0} and F, so 81 and 1. 
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9.19 If (a+ J)" =0+T7, then a” € J, and hence there exists an m € N such that 
(a")” = O; that is, a”” = 0, which means that a € J, andhencea+/=0+T. 


9.21 (1) No, as a@(1- 1) = 2 but a(le(1) = 4. 
(2) Yes. If f(x), g(x) € R[x], thena( f(x) + g(x)) = f(2)+8(2) =a(f(x))+ 
a(g(x)) anda(f(x)g(x)) = f(2)g(2) = a( f(x) )a(g(x)). 


9.23 For any r},r2 € R, we have B(a(r; +12)) = Bla(r}) + @(r2)) = B(@(1)) + 
B(a(r2)), and similarly for multiplication. 


9.25 It is a homomorphism, as a((a,b) + (c,d)) = a((a+c,b+d)) = (at 
c,0) = a((a, b)) +a((c, d)), and similarly for multiplication. The kernel is {0} © Z. 
Furthermore, w~!(2Z @ 3Z) = 2Z @ Z. 


9.27 Let S = R/I and definea : R > S viaa(a) =a+I]1.Wehavea(a+b) = 
a+b+7=(a+/1)+(6+4+/) =a(a) + a(b), and similarly for multiplication, so 
a is ahomomorphism. Also, a € ker(q@) if and only if a+ 7 = 0+ /; that is, if and 
only ifa eI. 


9.29 Leta: F — K bea homomorphism. Now, ker(q@) is an ideal of F. As F isa 
field, this means that ker(~) = {0} or F. In the former case, a is one-to-one, which 
is impossible, as K has fewer elements than F’. In the latter case, a(a) = 0 for all a, 
so this is the only possible homomorphism. 


9.31 (1) The additive groups are not isomorphic. (See Exercise 4.31.) 
(2) One has an identity, the other does not. 


9.33 Leta: R — S be an isomorphism. We claim that w(Z(R)) C Z(S). But if 
a € Z(R), then for any r € R, we have ar = ra, and hence a(a)a(r) = a(r)a(a). 
As a is onto, a(a) commutes with everything in S. Thus, restricting a to Z(R), we 
have a one-to-one homomorphism into Z(S). But if b € Z(S), then for any r € R, 
we have a(r)b = ba(r). Letting b = a(c), this means that a(r)a(c) = a(c)a(r); 
that is, a(rc) = a(cr), and since @ is one-to-one, rc = cr. In particular, b = a(c) € 
a(Z(R)). Therefore, a : Z(R) — Z(S) is onto as well, and hence an isomorphism. 


9.35 Let K be the field of fractions, and define a : F > K viaa(a) = [a, 1] for 
alla € F.Ifa,beé F,thena(a+b) =[a+b), 1] = [a, 1]+ [b, 1] = a(a)+ a(b) 
and a(ab) = [ab, 1] = [a, 1][b, 1] = a(a)a(b); thus, a is a homomorphism. If 
[a, 1] = [0, 1], thena = 0, soa is one-to-one. Furthermore, if a, b € F, withb 4 0, 
then a(ab~') = [ab~!, 1] = [a, b]; thus, a is onto. 


9.37 No: Z and Q are certainly not isomorphic (Q is a field but Z is not), however, 
we already know that the field of fractions of Z is isomorphic to Q, and by Exercise 
9.35, the field of fractions of Q is isomorphic to Q as well. 


9.39 (1) Note that 


(ca) * (Ga) = (oF att) =«((Ca)) +2((E3): 
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Furthermore, 


7 ab\/(ef\\ _ (aet+bg ce+dg a ef 7 ab 
cd)\gh)}) \af+bhcf+dh} — gh cd}}- 
Also, it is clear that applying @ twice returns the original matrix. 
(2) requires similar computations. 


9.41 Definea: ROS > Sviaa((r,s)) = s.Ifr; € R, 5; € S, then a((ry, 5;) + 
(r2, $2)) = a((r] +12, 51 + $2)) = 51 +52 = @((71, $1)) + (72, 52)), and similarly 
for multiplication. Thus, ~ is a homomorphism. If s € S, then a((0,5)) = s, and 
hence @ is onto. Finally, (7, s) € ker(@) if and only if s = 0; thatis, ker(~) = R@® {0}. 
Apply the First Isomorphism Theorem. 


9.43 Define a : Z[x] > Zs via a(f(x)) = [f(0)], where the square brackets 
denote the congruence class in Zs. If f(x), g(x) € Z[x], then a( f(x) + g(x)) = 
Lf) + 9(0)] = [FO] + [g)] = a( f(x) + @(g(x)), and similarly for multipli- 
cation. Thus, a is a homomorphism. If [a] € Zs, then letting f(x) be the constant 
polynomial a, we see that a(f(x)) = a; thus, a is onto. Furthermore, as f (0) is 
the constant term of f(x), we see that ker(q@) is precisely 7. Now apply the First 
Isomorphism Theorem. 


9.45 The first part is the Third Isomorphism Theorem. To see the second part, note 
that 3Z,/12Z = {0+ 12Z, 3+ 12Z, 6+ 12Z, 9+ 12Z} is a commutative ring having 
identity 9+ 12Z. Furthermore, its characteristic is 4. Thus, it has a subring isomorphic 
to Z,4. As the ring only has four elements, the ring is itself isomorphic to Z4. 


9.47 Note that (1+7)(1 —i) = 2 € (2), and yet neither 1 +i nor 1 —i is a multiple 
of 2 in R. The ideal is not prime and hence, as R is a commutative ring with identity, 
not maximal. 


9.49 Let J be a prime ideal. Then R/J is an integral domain. But a finite integral 
domain is a field (see Theorem 8.10), so R/J is a field, and hence J is maximal. 


9.51 Let R = 2Z, = {0, 2} and J = {0}. Now, J is surely maximal, since if it got 
any larger, it would be R. But it is not prime, as 2 ¢ J, but2.2 =O eT. 


9.53 Ina field, the only element that is not a unit is 0, and {0} is an ideal. In Z,», we 
know (see Exercise 8.30) that the units are precisely the elements a that are relatively 
prime to p”. In other words, the elements that are not units are those that are divisible 
by p, so (p) is the ideal in question. 


9.55 Use P= R@I.AsT # R, wesee that P 4 R@R. Also, if (a, b)(c, d) € P, 
then bd € I. As I is prime, either b or d is in J, and hence (a, b) or (c, d) is in P. 
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Problems of Chapter 10 


10.1 f(x) — g(x) = 9x4 + 8x? + 4x? + 6x +4, f(x)e(x) = 4x7 + Tx® + 2x9 + 
xt 47x? + 4x +5. 


10.3 q(x) = 5x? + 6x +3, r(x) = 4x7 +1. 
10.5 No. According to the preceding exercise, x is not a unit. 


10.7 Suppose that char R[x] = n > O. Then, in particular, for every constant 
polynomial a, we have na = 0. Thus, 0 < char R < n. On the other hand, if char 
R=~m > 0,then for any f(x) € R[x], we note that for each coefficient a; appearing 
in f(x), we have ma; = 0; thus, mf(x) = 0, and 0 < char R[x] < m. The only 
remaining case is where char R and char R[x] are both 0. 


10.9 Certainly 0 € S[x]. Take f(x), g(x) € S[x]. Then all coefficients of f(x) and 
g(x) lie in S. The coefficients of f(x) — g(x) are differences of elements of S, and 
hence lie in S, so f(x) — g(x) € S[x]. Similarly, the coefficients of f(x)g(x) are 
sums of products of elements of S and thus lie in S. Hence, S[x] is a subring. Let S be 
an ideal. If f(x) € S[x] and g(x) € R[x], then the coefficients of f(x)g(x) are sums 
of products, where each term in the sum is an element of S multiplied by an element 
of R, and therefore lies in S. Thus, f(x) g(x) € S[x]. Similarly, g(x) f(x) € S[x]. 


10.11 As a and ab are associates, write ab = au, where u is a unit. If a € 0, then 
cancellation gives b = u. 


10.13 Let a be a unit. Then for any 0 4 Db € R, we have b = a(a'b). Thus, 
é(a) < €(b), so €(a) is indeed the smallest possible value, n. Now suppose that a is 
not aunit. We can write | = aq+r,whereg,r € R andeitherr = Oore(r) < e(a). 
In the former case, a is a unit, which is a contradiction. In the latter case, ¢(a) is not 
the smallest possible value. 


10.15 We have 


3 ig By Db 
Fey = 409 (3) +( 5 > * e+) 
a 73 132 19 3\(_4, 46) (720, 144, 216 
x)= x x x x x x . 
. 2 2 2°°3 7" 49 49 49 49 


and since Px? + wx + 26 divides —ix3 ee 3, the former is a gcd. We 


must make it monic, so multiplying by 49/72, we get (f(x), g(x)) = x7 + 2x +3. 


10.17 Beginning with the second of the two equations in the solution to Exer- 
cise 10.15, we see that 
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a 7, #13, 19 3 4 46 
(F@). 8) = 5 (se ( ee >) ( - a) 
49 3 4 46 
= 5 (ee (409-9 (3)) (-3°- 8) 
eee ee 
= FO) \ Tg% + 36 J +8) | 35 — ag): 
10.19 We must apply the Euclidean algorithm. Let us use the notation established 
in Example 10.8, taking vu = 5+ 7i and v = 1 4+ 37. Now, (1 + 3i/)(1 — 3i) = 10, 


souv-! = (5+7i)(1— 3i)/10 = 2.6 — 0.87. Thus, we have m = 3 andn = —1, so 
gq =3-iandr = (54+ 7i) — 1 + 31)(3 —i) = —1 —1. That is, 


5471 = (1+3/)8-i1)4+ (C1-i). 


For the next step, we let uv = 1+ 3i and v = —1 —7. But (—1 —i)(—1+/) =2,s0 
uv! = (1+ 3i)(-1+ i)/2 = —2 —i. Therefore, 


(43 = (1 )-2-) +6, 


Thus, —1 —i isa gcd of 5+ 7i and | + 3i. 


10.21 Using the notation as in Example 10.15, we note that N(1 + 2\/5i) = 21. 
If 1+2/5i = uv, then N(u)N(v) = 21, and assuming without loss of generality 
that N(u) < N(v), we have N(u) = 1 or 3. As in Example 10.15, N(u) = 3 is 
impossible and N(u) = | means u € {1, —1}. In particular, uv is a unit and 1 + 2S 5i 
is irreducible. However, (1 + 2/5i)(1 — 2/5i) = 21 = 3-7. Thus, (1 + 2/5i)|21. 
But as N(3) = 9 and N(7) = 49, we cannot possibly have | + 2/5i dividing 3 or 
7. Thus, 1 + 2,/5i is not prime. 


10.23, Combine the preceding two exercises with Theorem 10.11. 


10.25 Not necessarily. We know that Z is a Euclidean domain, but Z[x] is not a 
PID, hence not a Euclidean domain. 


10.27 This is essentially the same as Exercise 2.24. 


10.29 Suppose not, and let 0 4 a € R be a nonunit. Let J, = (a"). As a"|a"*!, we 
see that I,,, C J,. Suppose that J, = J,,,. Then a” € (a"*'); that is, a” = a"*'b, 
for some b € R. Cancelling a", we get 1 = ab. Thus, a is a unit, giving us a 
contradiction. 


10.31 Using the notation in Example 10.8, we have e(1 +i) = 2. As we noted in 
that example, if u and v are in our ring, and uv = 1 +i, then e(u)e(v) = 2. But 
€(u) and e(v) are nonnegative integers, so without loss of generality, e(u) = 1. This 
means u € {+1, +i}, and so u is a unit. Thus, | + 7 is irreducible. However, we 
know that R is a Euclidean domain, and hence a PID, so every irreducible is prime, 
by Theorem 10.11. 
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10.33 Not necessarily. We know that Q[x] is a UFD, and yet its subring R discussed 
in Example 10.18 is not. 


10.35 Leta = 2,b =5,c =2+4+ J6i andd = 2 — Ji. Clearly ab = cd = 10. 
Defining a norm as in Example 10.15 via N(@m + nv/6i) = m* + 6n”, we see from 
the same calculation that N(uv) = N(u)N (vy) for allu, v € R. Suppose that uv = 2. 
Then N(u)N(v) = 4, soeither N(u) = N(v) = 2 (whichis impossible) or N(u) = 1 
and N(u) = 4 (or vice versa). But this means u is | or —1. In particular, u is a unit, 
so 2 is irreducible. Similar calculations show that b, c and d are irreducible. It is 
immediate that neither a nor b divides c or d. That R is not a UFD follows from the 
definition. 


10.37 Let p be an irreducible of R. Then p is a nonzero nonunit. Suppose that p|ab, 
for some a, b € R. If a is a unit, then b and ab are associates, so p|b. If a = 0, then 
p\a. Similarly if b is zero or a unit. So let a and b be nonzero nonunits. We may 
write a = p,--- py and b = q)---q), where the p; and q; are irreducible. By the 
preceding exercise, p divides some p; or some qj. Without loss of generality, say 
P\qi. Since qi|b, we have p|b. 


Problems of Chapter 11 


11.1 (1) As 5 is a root and the degree is greater than 1, no. 

(2) Trying each possible root in Z7, we see that this polynomial has no root there. 
Thus, since the degree is 3, it is irreducible. 

(3) No, since it factors as (x? + 4)(x? + 4). 


11.3 The possibilities are x? + ax? + bx +c, where a,b,c € {0, 1}. Ifc = 0, then 
0 is aroot, soc = 1. Also, 1 is aroot of x? + 1 and x? + x2 + x + 1, so we can rule 
them out. The remaining polynomials are x* + x? + 1 and x3 + x + 1. Both have 
degree 3, and neither has a root, so they are irreducible. 


11.5 Leth(x) = f(x)— g(x). If f(x) 4 g(x), then A(x) is not the zero polynomial. 
Say deg(h(x)) = n. Then h(x) can have at most n roots, buth(a) = f(a)—g(a) = 0 
for all a € F, giving us a contradiction. 


11.7 No, take a,b € R such that ab ~ ba. Letr = a, f(x) = x and g(x) = D. 
Then a(f (x)g(x)) = a(bx) = ba, whereas a(f (x))a(g(x)) = ab. 


11.9 As deg(x? + 1) = 2, the polynomial is reducible if and only if it has a root 


m € {0,1,..., p — 1}. Factoring out x — m, we can only be left with x — n, for 
somen € {0,1,...,p— 1}. Thus, x? + 1 = x? —(m+n)x +mn. That is, x? + 1 is 
reducible if and only if there exist m,n € {0,1,..., p— 1} such that p|(m +n) and 


p|(mn — 1). Given the range of values for m and n, we can only have m+n € {0, p}. 
But m and n cannot possibly both be 0, so m+n = p. 
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11.11 (1) The only possible rational roots are +1, +2. But none of these work, so 
it has no rational roots. 

(2) The possible rational roots are of the form m/n, where m|(—2) and n|6. Trying 
all of the possibilities, we see that —1/2 and 2/3 are roots. 


11.13 (1) Looking first for rational roots, we know that they must be integers and 
divide 18. We find that 3 is a root, so we have (x — 3)(x? — 7x” + 14x — 6). Now, 3 is 
also a root of x? — 7x? + 14x — 6, so we have (x — 3)?(x? —4x +2). By Eisenstein’s 
criterion, we are now done. 

(2) Looking first for rational roots, we see that they can only be +1, +2. In fact, 
—2 is a root, so we have (x + 2)(x? + x + 1). Now, the only possible rational roots 
of x? + x + 1 are 1 and —1, and these do not work. A degree 3 polynomial with no 
roots is irreducible, so we are done. 


11.15 Note that f(x) is a constant polynomial if and only if f(x + a) is a constant 
polynomial. Thus, we may assume that both have degrees larger than 1. Suppose 
that f(x) = g(x)h(x), where g(x) and h(x) are nonconstant polynomials. Then 
f(xt+a) = g(xt+a)h(x+a). As g(x+a) and h(x+a) are nonconstant polynomials, 
it follows that f(x + a) is reducible. The converse is similar. 


11.17 It is irreducible, using the preceding exercise with p = 7. 


11.19 (1) We know that 2 — 3i must also be a root of f(x), so f(x) is divisible 
by (* — (24+ 31))(« — 2 — 31) = x? — 4x + 13. Performing the division, we get 
f@) = (x? — 4x + 13)(x — 7), so the third root is 7. 

(2) Here, 1 +7 is also a root, so f (x) is divisible by (x — (1 —1))(x —-(1+i)) = 
x? —2x +2, Performing the division, we get f(x) = (x? — 2x + 2)(x? + 2x + 3). 
By the quadratic equation, the remaining roots are —1 + /2i and —1 — /2i. 


11.21 (1) By Eisenstein’s criterion, the polynomial is irreducible over Q. In R[x], 
we can factor it as (x — 10) (x + 10) (x? + al 10): In C[x], we factor further and 
get (x — Y10)(x + 10)(x — 101) (x + 103). 

(2) Using the Rational Roots Theorem, we find that 2 is a root. Thus, we can 
factor it as (x — 2)(x? + 3x + 11). But x? + 3x + 11 is irreducible over R, hence 
over Q, so we are done in those two cases. For C, we use the quadratic equation and 


get (x — 2)(x — (—3 + V35i)/2)(x — (—3 — V35i)/2). 


11.23 The roots must also include 2 + 5i and 4 — i, so we can use 


(x — (2— 5i))(x — (24+ 51) — 44+ 1))(« — (4-1) - 6) 
=x? — 18x* + 150x? — 768x + 2293x — 2958. 


11.25 (1) Reducing modulo 5, we get x* + 2x + 1. We see that it has no roots in 
Zs, and since it has degree 3, the polynomial is irreducible in Zs[x], and hence f (x) 
is irreducible in Q[x]. 

(2) Reducing modulo 3, we get x4 + x? + 2. This has no roots in Z3, but we must 
rule out the possibility of a product of two polynomials of degree 2. We may assume 
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that both such polynomials are monic, and we can only have something of the form 
(Ge er4 1G? + bx $3) = 2? ee +2; Comparing coefficients, we find that 
a+b =Oand2a+b = 0. Thus, a = b = 0. But (x? +1)@?4+2) =x*4+24 
x4+ 4x? +2. Thus, our polynomial is irreducible in Z3[x], and f (x) is irreducible in 


Qi]. 


11.27 The monic polynomials of degree 2 are precisely those of the form x?+-ax+b, 
witha, b € F. There are thus n? of them. Such a polynomial is reducible if and only if 
it factors as (x —c)(x —d), withc, d € F. Whenc = d, there aren choices. Ifc 4 d, 
there are n choices for c and n — | for d. Of course, (x —c)(x —d) = (x —d)(x —c), 
so we get n(n — 1)/2 possibilities, for a total of n + n(n — 1)/2 = n(n + 1)/2 
reducible polynomials. By unique factorization, all of them are distinct. Thus, the 
number of irreducibles is n? — n(n + 1)/2 =n(n — 1)/2. 


11.29 (1) x*++1= (x? +a)(x? —a). 
(2) x4 + 1 = (x? +. ax — 1)(x? — ax — 1). 
(3) x4 + 1 = (x? +ax + 1)(x? — ax +1). 


Problems of Chapter 12 


12.1 No, as x” + 1 and x” lie in V but their difference, 1, does not. 


12.3 Wehave0 € U and0 € W,so0 € UNW. If v1, v2 e UNW, then vy), v». € U, 
so vy +v2 € U. Similarly, vy} + v2 € W,sovj+v.eUNW. Ifa e F,thenav,; «e U 
andav, € W,soavy, € UNW. The argument for an arbitrary collection of subspaces 
is similar. 


12.5 As0 € U,wehave0 = a(0) € a(U). (This follows immediately from the fact 
that a is, by definition, ahomomorphism of additive groups.) Also, if a(u,), a(u2) € 
a(U) anda € F, then a(u,) + a(u2) = a(u; + u2) € a(U), since uy + u2 € U, 
and aa(u;) = a(au,) € a(U), since au; € U. 


12.7 It is. As2-0+3-0+7-0 = 0, we see that (0,0,0) € W. Suppose that 
(a,, b, C1), (ao, bo, C2) € Wanda € F. Then 2(a; +a2)+3(b) +b2)+7(cy +02) = 
(2a, +3b, +7c,)+ (2a. +3b.+7c2) = 0+0 = 0,80 (a; +42, b} +2, Cc; +02) € W. 
Also, 2aa, +3ab, + 7ac, = a(2a, +3b,+7c;) = a-0 = 0; thus, (aa), ab}, ac) € 
W. 


12.9 Wehavev+v+v=lv4+lv4+lv=(14+14+)v=0v=0. 


12.11 (1) As 31, 3,5) + 2(2, 1,4) — 1(7, 11, 23) = (0,0, 0), they are linearly 
dependent. 

(2) Suppose that a(1, 3, 4)+b(2, 2, 1)+c@, 6, 3) = (0, 0, 0). Thena+2b+3c = 
3a+2b+ 6c = 4a+b+3c = 0. Thus, 3a —b = a —2b = 0. We see immediately 
that a = b = 0, and hence c = 0. Therefore, the vectors are linearly independent. 
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12.13 (1) No. If they did, then as (1, 0, 2) + (2,5, 3) = (3,5, 5), the vectors are 
linearly dependent, which means that some proper subset would form a basis for Q?. 
But Q? is 3-dimensional over Q, so this is impossible. 

(2) Yes. We claim that the vectors are linearly independent. If a(1,0,2) + 
b(2, 3,5) + c(0,0,4) = (0,0, 0), we see immediately that b = 0, from which 
it follows that a = 0 and then c = O. Thus, we can add vectors to this set to find a 
basis for Q>. But again, we are in a space with dimension 3, so no more vectors can 
be added. Therefore, the vectors span the space. 


12.15 If the field is C, we can see that every matrix can be written in a unique and 


; : Poh. 10 01 00 00 
obvious way as a linear combination of (( ) : i 0) ; ({ : C 7) so these 


matrices form a basis and the dimension is 4. Working over R, we would also need 


iO O07 00 00 F Seas 
€ 0) , € 5) ; ({ ‘a ; ic a) so the dimension is 8. 


12.17 Let dim V = n. If n = 0, then V = {0} and the only possible subspace is 
{0}, so there is nothing to do. So assume that n > 1. If W = {0}, then again, there 
is nothing to do. So assume that there exists 0 A w; € W. Then wv is, by itself, 
linearly independent. If w; spans W, then we have a basis for W. If not, then there 
exists a w2 € W such that w2 is not a scalar multiple of w;. But now w, and w are 
linearly independent. If they span W, we have a basis. Otherwise, find w3; € W such 
that w3 is not a linear combination of w; and w2. Repeat this procedure. We cannot 
possibly go beyond w,,, as V cannot have n + | linearly independent vectors. Thus, 
W has a basis consisting of at most n elements, so dim W < dim V. If W  V, then 
we can add to the basis for W to obtain a basis for V, which means we must have 
dim W < dimV. 


12.19 Suppose that a;a(v,) + a2a(v2) +--+ +a,a(v,) = 0, for some a; € F. Then 
a(ajv; +--++an,Vv,) = 0. As @ is one-to-one, a,v] +-+:+d,V, = 0. But the v; are 
linearly independent. Thus, a; = --- =a, = 0. 


12.21 Leta = /5 + J7. Then a? = 12 + 2,/35, so (a2 — 12)? = 140. Thus, a 
satisfies f(x) = x4+—24x?+4, We must show that f (x) is irreducible over Q. If it has 
a root in Q, then by the Rational Roots Theorem, the root must lie in {+1, +2, +4}. 
But none of these work. The only other possibility is that f(x) is the product of two 
polynomials of degree 2. By Theorem 11.4, they may be assumed to be in Z[x]. Up 
to a factor of —1, and noting that there is no x* term in f (x), the factorization must 
be (x? + bx + c)(x? — bx +d), for some b, c,d € Z. As there is no x term in Ff), 
either b = 0 orc = d. If b = 0, we have c+ d = —24 and cd = 4. No integers can 
possibly satisfy these equations. So, assume that c = d. We are left with the cases 
(x? + bx + 2)(x? — bx + 2) and (x? + bx — 2)(x? — bx — 2), for some integer b. 
These possibilities yield, respectively, 4 — b? = —24 and —4 — b? = —24. Neither 
of these equations has a solution in Z. 


12.23 Suppose that [K : F] = n. Ifa € K, then 1,a,a’,...,a" are linearly 
dependent over F', by Lemma 12.1. Thus, there exist b; € F, not all zero, such that 
bo t bya + boa? +--+» +b,a" = 0. That is, a is a root of by + Bix +--+ + b,x". 
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12.25 Let L = Baar F,,. As 1 € Fi, we have 1 € L. Suppose that r,s ¢ L. Then 
r € Fy, 5 € F,, for some m,n € N. Letting k be the larger of m and n, we have 
r,s € Fy. Thus,r—s eR CLandifs £0,rs'e€R CL. 


12.27 Asa € F(a) and F(a) is a field, we must have a” € F(a). Also, F € F(a). 
As F(a”) is the intersection of all subfields of K containing F and a’, it follows that 
F(a’) © F(a). For the second part, let F = Q and a = i. Then Q(a*) = Q(-1) = 
Q, but Q(a) contains 7, so the fields are different. 


12.29 The minimal polynomial of a is irreducible over C. By the Fundamental 
Theorem of Algebra, this minimal polynomial has degree 1, and must therefore be 
x —a € C[x]; thus, a € C. 


12.31 Note that f(x) = x* + x + 1 is irreducible over Z7. (It has degree 3 and no 
roots in Z7.) Thus, F = Z[x]/(f (x)) will work. Letting a = x + (f (x)), we know 
that the elements of F are the linear combinations of 1, a and a? over Z7. Also, a is 
aroot of f(x), soa? = —a — 1 = 6a + 6 and a* = (6a + 6)a = 6a? + 6a. Thus, 
(a2 +5a+4)(3a? + 6) = 3a* +. a? + 4a? + 2a + 3 = 3(6a? + 6a) + (6a + 6) + 
4a? + 2a+3=a74+5a+2. 


12.33 If a and b are any two roots of x* — 2, then (ab—!)? = 1, so ab“! is one of 
the roots of x? — 1. One such root in C is 1 and another is w. Also, (w”)? = 1, so 
w” is the third complex root of x3 — 1. Thus, since Q(/2, q@) must contain A/D: lo 
and w”, we see that it contains every root of x* — 2; in particular, x* — 2 splits over 
Q(./2, w). On the other hand, if x? — 2 splits over any subfield, then that subfield 
would have to contain all three roots, namely, J2, ws/2 and w/2. As it is a field, 
this means it must contain w as well, so it is all of Q(/2, @). 


12.35 Note that Q(/2) isa splitting field of x? —2 over Q. (Both roots, J/2and —/2 
are in the field, and would have to be in any splitting field.) As an automorphism a 
must map the identity to the identity, we see immediately that wa(c) = c forallc € Z. 
Similarly, ifm,n € Zwithn > 0, thenm = a(m) = a(n(m/n)) = na(m/n). Thus, 
a(c) = c forall c € Q. By the preceding exercise, a(./2) must be a root of x? — 2; 
in particular, a(/2) €E {./2 : 4/2}. In the former case, a is the identity function. In 
the latter case, a(a + b./2) =a — by? for all a,b €Q. By Lemma 12.4, this is an 
automorphism. 


12.37 Let K bea splitting field for f(x) over F. Say that in K [x], we have f(x) = 
a(x—ad\)(x—az)---(x—a,). Then g(x) = a(x+1—a;)(x+1—az)--- (x+1-—a,) = 
a(x — (a, — 1))(x — (a2 — 1))- ++ (x — (@ — 1)). Since the a; lie in K, so do the 
a; — 1; thus, g(x) splits over K. Furthermore, for g(x) to split, all of the a; — 1 must 
be present, and hence so must all of the a;. Thus, we cannot make K any smaller 
and have g(x) split, so K is a splitting field for g(x). Showing that splitting fields 
for g(x) must be splitting fields for f(x) involves a similar argument. 


12.39 If |F'| = p”, for some prime p and positive integer n, then F has one proper 
subfield for each integer m, 1 < m <n, with m|n. The first value n that works is 6, 
so the smallest such field has order 2° = 64. Specifically, it is the splitting field of 
x — x over Zp. 
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12.41 Leta € K be a root of f(x). Then [Zs(a) : Zs] = 3. If all roots of f(x) 
lie in Zs(a), then K = Zs(a), and |K| = 5°. Otherwise, in Zs(a)[x], we have 
f(x) = (« — a)g(x), where g(x) is an irreducible polynomial of degree 2. Letting 
b be a root of g(x) in K, we see that [Zs(a,b) : Zs(a)] = 2. Furthermore, in 
Zs(a,b), the polynomial f(x) splits into linear factors, so K = Zs(a, b). Now, 
[K : Zs] =[K : Zs(a)][Zs(a) : Z5] = 2-3 = 6, and |K| =5°. 


12.43 Every field of characteristic 0 is perfect, so char F = p, for some prime 
p. The fact that f(x) = dy + apx? + +++ + Gnpx'? follows exactly as in the 
proof of Theorem 12.16. Suppose that all of the a; are algebraic over the prime 
subfield, (an isomorphic copy of) Z,. Then [Zp(ao) : Zp] < oo. Also, ap, is 
algebraic over Z,, and hence over Z,(do), 80 [Zp (do, 4p) : Zp(ao)] < oo. Thus, 
[Zp (ao, 4p) : Zp] = [Zp(ao, ap) : Zp(ao) [Zp(ao) : Zp] < oo. In the same way, 
[Zip (do, Ap, -+-,Amp : Zp] < 00, which means that Zp(ao, dp, ..., Amp) 18 a finite 
field, and hence perfect. If f(x) is irreducible over F, it is surely irreducible over 
Zp(ao, --+, Amp). An irreducible polynomial over a perfect field cannot have multiple 
roots in any extension field. 


12.45 If it were cyclic, it would be infinite cyclic. But note that —1 € U(F), and 
—1 has order 2. An infinite cyclic group has no such element. 


12.47 Let F be the splitting field of x!° — x over Zs. We know that it has order 
125. Let f(x) € Zs[x] be an irreducible factor of x! _ yx Ifa € F is a root of 
Ff (x), then [Zs (a) : Zs] = deg(f(x)). But Zs(a) is a subfield of F'. A subfield of a 
field of order 5° can only have order 5 or 5°. Thus, deg(f (x)) = 1 or 3. 


Problems of Chapter 13 


13.1 WKHWUHDVXUHLVEXULHGWZHQWBSDFHVQRUWKRIWKHSDO- 
PWUHH 


13.3 We need k to be relatively prime to 26. If it is not, then letting d = (26, k), we 
see that both 0 and 26/d will be encrypted as 0, so decryption will be impossible. 
On the other hand, if (k, 26) = 1, then k € U(26), so we can decrypt by multiplying 
by k~!. (fk = 1 (mod 26), then multiplying by k does not change the text at all, 
so it would be reasonable to rule out this key as well.) 


13.5 JGVSHNEGJESCRPPRBSXBPPVGHBSJKEHXVT 
13.7 KXTNRHIOQJHVKWNKSVNHSWOXCLFAAMJSKSBO 


13.9 Writing n = pq, the smaller of p and q must certainly be less than ./n, 
so we only need to try primes up to 44. We discover that p = 37 and q = 53. 
Thus, g(r) = 36-52 = 1872. To find d, we use the Euclidean algorithm. In 
particular, 1872 = 43(43) + 23; 43 = 23(1) + 20; 23 = 20(1) + 3; 20 = 3(6) + 2; 
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3 = 2(1)+1;2 = 1(2)+0. Thus, 1 = 3(1)+2(—1) = 301)+(20(1)+3(—6))(-1) = 
20(—1) + 3(7) = 20(—1) + (23(1) + 20(—1))(7) = 23(7) + 20(—8) = 23(7) + 
(43(1)+23(—1))(—8) = 43(—8)+23(15) = 43(—8) + (18721) +43 (—43)) 15) = 
1872(15) + 43(—653). Therefore, 43(—653) = | (mod 1872). As we need d to be 
positive, adding 1872, we get d = 1219. 


13.11 We must break our message into blocks of length 2. As we have an odd 
number of letters, we add a Q to the end. Then AL is 0011, GE is 0604, BR is 0117 
and AQ is 0016. Next, 11!4? = 5581 (mod 17399), 604!49 = 2315 (mod 17399), 
117!*? = 4926 (mod 17399) and 16!4? = 9527 (mod 17399), so our encrypted 
message consists of the four numbers 5581, 2315, 4926 and 9527. 


13.13 Note thatn = 103-179 = 18437 and y(n) = 102-178 = 18156. To findd, we 
apply the Euclidean algorithm. Namely, 18156 = 151(120) + 36; 151 = 36(4) +7; 
36 = 7(5) + 1; 7 = 1(7) + 0. Thus, | = 36(1) + 7(—5) = 36(1) + 51d) + 
36(—4))(—5) = 151(—5) + 36(21) = 151(—5) + (181561) + 151(—120))(21) = 
18156(21) + 151(—2525). Therefore, —2525e = 1 (mod g(n)). As we need d to 
be positive, we add 18156 and get d = 15631. We now calculate 2469! = 1514 
(mod 18437), 7093!°°! = 1124 (mod 18437), 14773!°°! = 1314 (mod 18437), 
10900!563! = 1208 (mod 18437) and 143! = 11 (mod 18437) (which we 
remember to write as 0011). So, our message is 15141124131412080011, which 
translates to POLYNOMIAL. 


Problems of Chapter 14 


14.1 Construct the line through A and B. Next, construct the circle centred at A 
with radius AB. Say it meets the line at B and C. Construct the circle centred at B 
with radius AB, and say that it meets the line at A and E. Then the distance from C 
to E is 3, so if we construct the perpendicular bisector of C E, and it meets the line 
at D, then the distance from C to D is 1.5. 


14.3 Construct the circle centred at A with radius AB and the circle centred at 
B with radius AB. Let C be either of the intersection points of these circles. By 
construction, the three sides of ABC have the same length. 


14.5 We begin by constructing the line through B and A. Next, construct the circle 
centred at B with radius BC. It meets the line through B and A at two points; let E 
be the one of those points on the same side of B as A. Then replacing A with E, we 
may assume that in our original angle, A and C were equidistant from B. Construct 
the line through A and C, then construct the perpendicular bisector of AC. These 
two lines meet at the desired point, D. 
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14.7 Proceeding as in the solution to Exercise 14.3, construct a point E such that 
ABE is an equilateral triangle. It must lie on the circle. Now do the same thing with 
A and E; that is, construct a point C (the one that is different from B) such that AEC 
is an equilateral triangle. Again, C must be on the circle. Now do the same with A 
and C, and construct a new point F on the circle such that AC F is an equilateral 
triangle. Performing the same construction for A and F’, we obtain a new point D 
on the circle such that A FD is an equilateral triangle. And then in the same way, we 
can construct a point G on the circle such that ADG is an equilateral triangle. But 
now BEC FDG is a regular hexagon, so BC D is an equilateral triangle. 


14.9 As the constructible numbers form a field, if a + b were constructible, then 
a+b—a = bwouldalso be constructible, giving a contradiction. Similarly, if ab were 
constructible, then the field of constructible numbers would include a~!ab = b. Now, 
if —b were constructible then b would be constructible as well, so letting c = —b, 
we see that b + c = 0 is constructible. On the other hand, if we let c = b, then we 
get b+ c = 2b. If this were constructible, then since 1/2 is also constructible, we 
would find that b would be constructible as well. 


14.11 (1) Yes. As all integers are constructible and the field of constructible numbers 
is closed under the taking of square roots of its nonnegative elements, we see that 
2+ V5 — V3 is constructible, and then we can take the square root twice to obtain 
this element. 

(2) No. Once again, we know that /3 is constructible, but /3 is a root of x3 — 3. 
By Eisenstein’s criterion, this polynomial is irreducible over Q, so 3 has minimal 
polynomial x* — 3. As the degree is not a power of 2, \/3 is not constructible. By 
Exercise 14.9, the sum of a number that is constructible with one that is not is not 
constructible. 


14.13 By Eisenstein’s criterion, this polynomial is irreducible over Q. Thus, it is 
the minimal polynomial of a over Q. As the degree is not a power of 2, a is not 
constructible. 


14.15 We will prove a stronger statement, that an angle of 2/6 can be constructed. 
We are given the points (0, 0) and (1, 0). To obtain such an angle, we only need to 
construct the point (cos(z/6), sin(zr/6)) = (/3/2, 1/2). But by Theorems 14.1 and 
14.2, the numbers /3/2 and 1/2 are constructible, so the point is constructible. 


14.17 There is nothing todo forn = 1. Whenn = 2, we have cos(20) = 2 cos*(@)— 
1. For the n = 3 case, we look to the proof of Theorem 14.6, and see that cos(3@) = 
4cos*(@) — 3.cos(@). To handle the remaining cases, we simply note that cos(@) = 
cos(—0) = cos(2m — 0). Thus, cos(40@) = cos(82/7) = cos(67/7) = cos(30) and, 
similarly, cos(50) = cos(20) and cos(60) = cos(@). 
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14.19 Let D = A. We know that the number 2 is constructible, which means 
that the point EF = (/2, 0) is constructible. Construct the circle centred at A and 
passing through C. It will meet the x-axis (which we can construct) at (m, 0), where 
m is the distance from A to C. Thus, the number m is constructible, and so mV/2 
is constructible. In particular, we can construct the point (m/2, 0). Now draw the 
circle centred at A and passing through (m4J/2, 0). It intersects the line through A 
and C at a point F, where the distance from A to F is m2 and F is on the same 
side of A as C. The triangles ABC and DEF are similar. As the side lengths are 
increased by a factor of \/2, the area is increased by a factor of 2. 
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